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Abstract. Sumner's universal tournament conjecture states that any tournament on 2n — 2 
vertices contains a copy of any directed tree on n vertices. We prove an asymptotic version 
of this conjecture, namely that any tournament on (2 + o(l))n vertices contains a copy of 
any directed tree on n vertices. In addition, we prove an asymptotically best possible result 
for trees of bounded degree, namely that for any fixed A, any tournament on (1 + o(l))n 
vertices contains a copy of any directed tree on n vertices with maximum degree at most A. 
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Q ■ 1. Introduction 

1.1. Introduction. A tournament is an orientation of a complete graph. One of the most 
well-known problems on tournaments is Sumner's universal tournament conjecture, which 
was posed in 1971 (see e.g. [151 122]). 
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Conjecture 1.1. Let T be a directed tree on n vertices. Then every tournament on 2n 
vertices contains a copy ofT. 



The following simple example shows that the bound would be best possible: let G be a 
regular tournament on 2n — 3 vertices (so every vertex has n — 2 outneighbours) , and let T be 
a star with all edges directed outwards. Then the central vertex of T has n — 1 outneighbours, 
and so G does not contain a copy of T. 

A large number of partial results towards Sumner's conjecture have been obtained. Let 
f(n) denote the smallest integer such that any tournament on f(n) vertices contains any 
CN . directed tree on n vertices. So Conjecture 11.11 states that f(n) = 2n — 2. Chung (see |22j) 

observed that f(n) < n 1+ °^ l \ and Wormald [22] improved this bound to f(n) < nlog 2 (2n/e). 
The first linear bound on f(n) was established by Haggkvist and Thomason [6], who showed 
that f(n) < I2n, and also that f(n) < (4 + o{\))n. Havet [7] showed that f(n) < 38n/5 — 6, 
and then Havet and Thomasse |9j used the notion of median orders to improve this to /(n) < 
(7n — 5)/2. The current best bound is due to El Sahili [5J. 

Theorem 1.2 (|5j). Let T be a directed tree on n vertices. Then every tournament on 3n — 3 
^ ■ vertices contains a copy of T . 

h ; 

The conjecture has also been verified for some classes of trees, such as directed paths. 
Indeed, a classical result of Redei [18] implies that we can even find a spanning directed path 
in any tournament. 

Theorem 1.3 (|18|). For any positive integer n, any tournament on n vertices contains a 
directed path on n vertices. 



Thomason [21J proved a much stronger result, namely that whenever n is sufficiently large, 
every tournament on n vertices contains every orientation of the path on n vertices (this 
was a conjecture of Rosenfeld). Havet and Thomasse |10| showed that this even holds for all 
7. Reid and Wormald [19] also proved Sumner's conjecture for other (very restricted) 
classes of trees. Havet and Thomasse [9j proved that Conjecture 11.11 holds for arborescences, 
i.e. where T has a specified root r so that either every edge of T is directed towards r, or 
every edge of T is directed away from r. 
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We will prove an approximate version of Sumner's conjecture. We also prove an asymp- 
totically sharp bound for trees with bounded maximum degree. 

Theorem 1.4. Let a > 0. Then the following properties hold. 

(1) There exists uq such that for any n > uq, any tournament G on 2(1 + a)n vertices 
contains any directed tree T on n vertices. 

(2) Let A be any positive integer. Then there exists hq such that for any n > uq, any 
tournament G on (1 + a)n vertices contains any directed tree T on n vertices with 
A(T) < A. 

In [15], we prove Sumner's conjecture for large n. The proof relies on the results (and not 
just the methods) that we prove in this paper. 

Part (2) of Theorem 11.41 implies that Sumner's conjecture is true with room to spare for 
large trees of small maximum degree. The following example shows that (2) is best possible 
in the sense that the term an cannot be completely omitted: take a regular tournament H\ 
on 2k — 1 vertices, take an arbitrary tournament H2 on n — k — 1 vertices and obtain a 
tournament G on n + k — 2 vertices from Hi U H2 by adding all edges directed from H\ 
to H2. Also, let T be the tree on n vertices obtained from a directed path on n — k vertices 
by adding k extra vertices which all send an edge to the initial vertex of the path. Then G 
contains no copy of T. (We are grateful to P. Allen and O. Cooley for pointing out this 
example to us.) It would be interesting to know whether the term an can be reduced to a 
constant depending only on A. 

Another class of trees where Sumner's conjecture can be strengthened are trees with few 
leaves. The first result in this direction was proved by Haggkvist and Thomason [6]. Havet 
and Thomasse (see [8]) then proposed the following generalization of Sumner's conjecture. 

Conjecture 1.5 ([8]). Let T be a directed tree on n vertices with k leaves. Then every 
tournament on n + k — 1 vertices contains a copy ofT. 

Ceroi and Havet |3] proved that this conjecture holds for k < 3, from which they deduced 
that Sumner's conjecture holds for all trees with at most 4 leaves. 

For our proof of Theorem 11.41 we introduce a decomposition of an arbitrary tournament 
which searches for dense expanding subgraphs. We then introduce a randomized algorithm 
for embedding arbitrary trees into such dense expanding graphs. Both tools may be useful 
for other problems. For example, it would be interesting to know whether our methods can 
be extended to prove an approximate version of Conjecture 11.51 

1.2. Outline of the proof. The notion of a robust outexpander (which was introduced for 
dense graphs in [16J) is crucial to the proof. Informally, a digraph G is a robust outexpander 
if for any set S C V(G) which is not too large or too small, the number of vertices with many 
inneighbours in S is substantially bigger than \S\. Kiihn, Osthus and Treglown |16| showed 
that any robust outexpander G of linear minimum semidegree contains a Hamilton cycle. 
(Here the minimum semidegree is the minimum of the minimum indegree and the minimum 
outdegree.) Applying this to the 'reduced digraph' obtained from the Szemeredi regularity 
lemma, this implies that we can split most of the vertices of G into sets V\ , V2, ■ ■ ■ , V& so 
that the set of edges from Y- L to Vi + i for each i (addition of the indices taken modulo k) 
forms a quasirandom and dense bipartite graph. As we shall see, this structure is very useful 
for embedding trees. On the other hand, it is easy to show that if a tournament G is not 
a robust outexpander of linear minimum semidegree, then the vertices of G can be split 
into two parts so that almost all of the edges between the two parts are directed the same 
way (see Lemma I2.8j) . We shall then consider whether either of these two parts are robust 
outexpanders, and so on. 
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To begin, in Section [2] we shall define the concepts we shall use, and prove various lemmas 
which will be of use to us later on. Then in Sections and H] we show that Theorem 11.41 holds 
with the added condition that G is a robust outexpander of linear minimum semidegree. 
Indeed, in Section El we consider the case where the tournament G is a robust outexpander 
of linear minimum semidegree on (1 + a)n vertices, and T is a directed tree on n vertices 
of bounded maximum degree. As described above, we can split most of the vertices of G 
into clusters Vi, V2, ■ ■ ■ , Vk so that the set of edges from Vi to V%+i is quasirandom and dense 
for each i. Given this structure on G, one attempt to embed T in G would be to embed 
each vertex t £ T in the cluster either preceding or succeeding the cluster containing the 
parent t' of t, according to the direction of the edge between t and t'. However, for many 
trees this method will fail to give an approximately uniform allocation of vertices of T to the 
clusters of G, which we require for the embedding to be successful. Instead, we modify this 
method so that each vertex is embedded as above with probability 1/2 and is embedded in 
the same cluster as its parent with probability 1/2. We show that with high probability this 
randomised algorithm will indeed give an approximately uniform allocation of vertices of T 
to the clusters of G, and so will successfully embed T in G. 

In Section 0] we begin by strengthening the result from Section [3l showing that if T is a 
directed tree on n vertices of bounded maximum degree, and G is a tournament on (1 + a)n 
vertices whose reduced graph defined on the clusters Vi,...,Vk contains a Hamilton cycle, 
then we can embed T in G so that the vertices of a chosen small set H C V(T) are embedded 
within a specified set U C V(G). To do this, we embed all vertices 'far' from H by the method 
described above, which ensures that the vertices of T are allocated approximately uniformly 
amongst the clusters of G. The remaining vertices of T are instead embedded to ensure that 
every vertex of H is embedded within U. This result allows us to consider directed trees T 
of unbounded maximum degree. Indeed, we define for a tree T a 'core tree' T c , which has 
the properties that T c has bounded maximum degree, but each component of T — T c is small. 
This enables us to show that any tournament G which is a robust outexpander of linear 
minimum semidegree on (2 + a)n vertices contains any directed tree on n vertices. To do 
this, we again split most of the vertices of G into sets V\, V2, ■ ■ ■ , Vk as described above. We 
then choose subsets V- C V{ at random so that | |Jj V(\ is roughly equal to \T C \, and embed 
T c into these subsets (actually we first extend T c to an 'extended tree' T ext and embed T ext 
into these subsets), using the strengthened result for bounded degree trees to restrict certain 
vertices of T c to vertices of G with many inneighbours and outneighbours in G — \J t V[. Since 
each component of T — T c is small, this will allow us to embed the components of T — T c one 
by one in the unoccupied vertices of G to complete the embedding of T in G. 

It is a simple exercise to demonstrate that any transitive tournament on n vertices contains 
any directed tree on n vertices. In Section (H we prove an analogue of this for almost-transitive 
tournaments G. This means that the vertices of G can be ordered so that almost all of the 
edges of G are directed towards the endvertex which is greater in this order. We show that 
if G is an almost-transitive tournament on (1 + a)n vertices and T is a directed tree on n 
vertices then G contains T. 

Finally, in Section [6j we shall use the robust outexpander dichotomy to prove Theorem ll.4i 
Here we shall describe the proof of the first statement; the proof of the second is very similar. 
So let G be a tournament on 2(1 + a)n vertices and let T be a directed tree on n vertices. If 
G is a robust outexpander of linear minimum semidegree, then our results of Sections [3] and H] 
show that G contains T, as desired. On the other hand, if G is not a robust outexpander 
of linear minimum semidegree then we may split G into two parts as described above. We 
now examine the larger of these two parts. If this is a robust outexpander of linear minimum 
semidegree then we stop; otherwise we again split this part into two. We continue in this 
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fashion, always choosing the largest part of G, stopping if this is a robust outexpander and 
splitting it into two smaller parts if not. If we continue this process but do not find a robust 
outexpander of linear minimum semidegree, then G must be almost transitive. Indeed, each 
time we split G most of the edges across the split are directed the same way. So once all of 
the parts of G are sufficiently small, we can be sure that for some ordering of the vertices of 
G, almost all of the edges of G are directed according to this order. So by the result from 
Section EJ G contains T, as desired. 

So suppose instead that at some stage we stop because the largest part of G is a robust 
outexpander of linear minimum semidegree. Then we divide T into parts to be embedded 
amongst the parts of G, so that each part of G receives a part of T approximately proportional 
to its size. However, the robust outexpander part of G will actually receive slightly more 
vertices of T than it would from a proportional split. The results from Sections [3] and [4] 
guarantee that this part of T can still be embedded into the corresponding part of G. Since 
then the other parts of G will receive slightly fewer vertices of T than they would from a 
proportional split it will be possible to embed the remainder of T. 

2. Definitions. 

2.1. Notation. For a graph G, we shall write V(G) for the vertex set of G, and \G\ for the 
number of vertices of G. E(G) denotes the set of edges of G, and e(G) := \E(G)\. Similarly 
for sets X, Y C V(G), e(X, Y) denotes the number of edges between X and Y. We shall 
sometimes write v £ G to mean v £ V{G). The degree of a vertex v £ G, denoted d(v), is 
the number of edges e 6 E{G) incident to v. We denote the minimum and maximum degree 
(taken over all vertices of G) by S(G), and A(G) respectively. The distance d(u,v) between 
vertices u, v £ G is the length of the shortest path connecting u and v. 

A tree is a connected graph which does not contain any cycles. We will often use the fact 
that for any subtree T' of a tree T and any vertex x £ T there is a unique vertex y £ T' 
which minimises d(x,y) over all y £ T' . For any vertex x £ T and edge e £ E(T) incident 
to x, the weight of e from x, denoted w e (x), is the number of vertices y ^ x of T for which e 
is the first edge of the path from x to y. Each vertex y ^ x of T contributes to the weight 
from x of precisely one edge incident to x, so the sum of the weights from x over all edges 
incident to x is \T\ — 1. Also, if xy is an edge of T, then w e (x) + w e (y) = \T\. 

A rooted tree is a tree with a specified vertex r as a root. In a rooted tree every vertex x 
other than the root has a parent] this is defined to be the unique neighbour y of x with 
d(y, r) < d(x, r). If y is the parent of x then we say that x is a child of y. A leaf in a tree is a 
vertex of degree one; so every vertex other than the root is a child of some vertex, and every 
vertex apart from a leaf is a parent of some vertex. An ancestral ordering of the vertices of a 
tree is a linear order in which the root appears first and every other vertex appears after its 
parent. 

A directed graph G = (V, E), or digraph, is formed by a vertex set V and a set of edges E, 
where every edge e £ E is an ordered pair (u, v) of vertices of G. For u, v £ V we write u — > v 
or v <— u if (u, v) £ E(G). Also, for any vertex v of G, N + (v) denotes the set of vertices u such 
that (v,u) £ E(G), and N~(v) denotes the set of vertices u such that (u, v) £ E(G). d + (v) 
and d~(v) denote |A r+ (v)| and \N~(v)\ respectively, and 5 + (G) and 5~(G) are then defined 
to be the minimum of d + (v) and d~(v) respectively over all vertices v £ G. The minimum 
semidegree is 5°(G) = min{<5 + (G), 5~ (G)}. A tournament on n vertices is a digraph G on n 
vertices in which for any distinct u,v £ V(G) precisely one of u — > v and u v holds. So a 
tournament can be thought of as an orientation of the complete graph on n vertices. Given 
a digraph G, the underlying graph G un d er is the graph on V(G) in which there is an edge 
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between u and v if and only if either u — > v or u <— v. We define the distance d(u, v) between 
distinct vertices u and v of a digraph G to be the distance between those two vertices in the 
underlying graph G un der- Also, if G is a graph or digraph, and H is a subgraph of G, then we 
write G — H to denote G[V(G) \ V(H)], that is, the subgraph of G induced on those vertices 
not in H. 

A directed tree is a digraph T for which the underlying graph T un( j er is a tree and in which 
at most one of x — > y and x <— y holds for any pair of vertices x and y of T. We use the 
notation x — > y to distinguish a directed edge from an undirected edge, for which we use the 
notation xy. Given a specified vertex r as a root, we define parents and children of vertices 
of the directed tree T exactly as in the underlying tree T unc j er . Similarly A(T) = A(T un der), 
and the weight w e {x) of an edge e incident to a vertex x is defined as in T un d er . Also, for each 
vertex v, w + (x) is the sum of w e {x) over all edges e incident to x directed away from x, and 
w~(x) is the sum of w e (x) over all edges e incident to x directed towards x. More generally, 
for a subtree T' of T, w + (T') is the sum of w e (x) over all edges e directed from a vertex x 
of T' to a vertex of T — T", and w~(T') is the sum of w e (x) over all edges e directed from 
a vertex of T — T' to a vertex x of T' . We say that a vertex of a digraph is a sin& vertex if 
it has no outneighbours, and a source vertex if it has no inneighbours. Since a directed tree 
on n vertices has n — 1 edges, any directed tree must contain at least one sink vertex and at 
least one source vertex. 

Throughout the paper we use the notation x <C y to indicate that for any y > there 
exists xq > such that for any < x < xq the subsequent statements hold. Such statements 
with more variables are defined similarly. Also, we will sometimes write 'let x <C y' when y 
has an already fixed positive value; by this we mean that there exists some xq > such that 
for any < x < xo the subsequent statements hold. When we use asymptotics such as o(l) 
we mean that these hold as n — > oo and all the other parameters are fixed. 

2.2. Probabilistic estimates. The next lemma, relating to binomial distributions, will be 
used to show that in the randomised algorithm we use in Section El the cluster to which a 
vertex is allocated is almost independent of the cluster to which a vertex far away is allocated. 
We use B(n,p) to denote the binomial distribution with parameters n and p, that is, the 
number of successes in n independent trials, each of which has probability p of success. So 
E(£>(n,p)) = np. 

Lemma 2.1. Suppose that 1/k <C p, (1 — p), e, that n > k 3 /6, and that X = B(n,p). Then 
for any < r < k — 1, 

¥(X = r mod k) = (1 ±e)/k. 

Proof. For each x G {0, . . . , n} let p x denote F(X = x), so p x = (™)p x {l —p) n ~ x - Let fx = np, 
so E(X) = fx, and let p^ = max{pi„i ,P[y|}, so p x < p^ for any x. Moreover, if x < y < fi or 
/i < y < x then p x < p y . So for any r, i G [k], 

F(X = r mod k) = p x + + 

0<x<fi—k (j,— k<x<(j,+k fj,+k<x<n 

x=r mod fe x=r mod A; o;=r mod k 

< Px+i+2pf,+ ^2 Px-k+i 

0<x<fi—k n+k<x<n 
x=r mod k x=r mod k 

< F(X = r + i mod k) + 2p^. 

So P(X = r mod fc) = 1/A; ± 2p^ = (I ±e)/k for any r 6 [&], using a standard result (e.g. 
[3], Section 1.2) on the binomial distribution which states that p^ = 0(n -1 / 2 ) = 0(k~ 3 / 2 ). 

□ 
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The following two results give useful tail estimates for random variables. The first is an 
Azuma-type inequality which bounds the sum of many small and almost independent random 
variables. This is derived in [20] from a result in [IT]. (|20j uses a random walk to embed 
trees in sparse undirected graphs.) The second gives standard Chernoff-type bounds for the 
binomial and hypergeometric distributions. The hypergeometric random variable X with 
parameters (n, m, k) is defined as follows. Let TV be a set of size n, and fix a set S C TV" of size 
\S\ = m. Now choose a set T C N of size \T\ = k uniformly at random. Then X = \T f] S\. 
Note that EX = kra/n. 

Lemma 2.2 ( [20] . Proposition 1.1). Let X±, . . . ,X n be random variables taking values in 
[0, 1] such that for each k 6 [n], 

E(Xk | X\, . . . , Xk-\) < aj~. 
Let \x > Y17=i ai - Then for any < 6 < 1, 



E 

i=i 



Xi > (1 + o» < e 



3 



Proposition 2.3 (|11|. Corollary 2.3 and Theorem 2.10). Suppose X has binomial or hyper- 

2 

geometric distribution and < a < 3/2. Then F(\X - EX\ > aEX) < 2e" V EX . 

2.3. Regularity and Robust Outexpanders. To prove Theorem ll.4l we shall make use of a 
directed version of Szemeredi's Regularity lemma. For this, we make the following definitions. 
If G is an undirected bipartite graph with vertex classes X and Y, then the density of G is 
defined as 

e(X,Y) 



d(X,Y) 



\X\\Y\ 



Now, for any e > 0, we say that G is e-regular if for any X' C X and Y' C Y with \X'\ > s\X\ 
and \Y'\ > e\Y\ we have \d(X',Y') - d(X,Y)\ < e. 

Given disjoint vertex sets X and Y in a digraph G, we use G[X — > Y] to denote the edges 
of G directed from X to Y. We say G[X — > Y] is e-regular with density d if the underlying 
bipartite graph of G[X — > Y] is e-regular and has density d. Next we state the degree form 
of the regularity lemma for digraphs. A regularity lemma for digraphs was proven by Alon 
and Shapira [2] . The degree form follows from this in the same way as the undirected version 
(see [2] for a sketch of the latter) . 

Lemma 2.4 (Regularity Lemma for digraphs). For any s,M' there exist M, no such that 
if G is a digraph on n > no vertices and d £ [0, 1], then there exists a partition ofV(G) into 
Vo , . . . , Vjt and a spanning subgraph G' of G such that 

(1) M' < k < M, 

(2) \V 1 < en, 

(3) \Vi\ = ■■■ = \V k \ =:m, 

(4) d~Q,{x) > d~Q(x) — (d + e)n for all vertices x £ V(G), 

(5) d,Q,(x) > dg(x) — (d + e)n for all vertices x £ V(G), 

(6) for all i 6 [fc] £/ie digraph G'\Vj\ is empty, 

(7) /or a// 1 < i, j < k with i ^ j the pair G'[Vi —> Vj] is e-regular and either has density 
or density at least d. 

We refer to V\, . . . , Vp- as clusters. Given a graph G on n vertices, we form the reduced 
digraph R of G with parameters e, d and M' by applying the regularity lemma with these 
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parameters to obtain Vq , ■ ■ ■ , Vk ■ R is then the digraph on vertex set { 1 , . . . , k} , with i — »• j 
an edge precisely when G'[Vi — > Vj] is e-regular with density at least d. 

One particular regular structure will appear frequently in Section [3] and Section [H We say 
that a digraph G is an e-regular d-dense cycle of cluster tournaments if V(G) = V\ U • • • U Vk, 
where the sets Vi are pairwise disjoint and of equal size, and for each i, G[Vi] is a tournament 
and G[Vi — > Vi+i] is e-regular with density at least d (where here and throughout this paper 
addition and subtraction on the indices of clusters is to be taken modulo k). We shall often 
refer to the sets Vi as clusters, as we will obtain them by an application of the regularity 
lemma. 

Now, let Vi,... , Vfc be disjoint sets of m vertices, and let G be a digraph on vertex set 
Vi U • • • U Vfc. Let S be a subset of some cluster Vi. Then we say that S is (c, y)-good if for 
any V/_ 1 C Vi-\ and V{ +1 C V^+i with |V^_i| > cm and |V/ +1 | > cm, S 1 contains at least -)\fm 
vertices which each have at least 7m inneighbours in V-_ 1 and at least 7m outneighbours in 
V/ +l . Our main tool in the use of regularity will be the next lemma, which states that if G is 
a regular and dense cycle of cluster tournaments, then any subset V- of any cluster Vi with 
|V/| > jm/2 contains a (c, 7)-good subset S of size at most \fm. 

Lemma 2.5. Suppose that 1/m <C e >C 7 <C c, d. Let G 6e an e-regular d-dense cycle of 
cluster tournaments on clusters V\, . . . , Vk, each of size m. Then for any i and for any V- C Vi 
of size |V/| = 7m/2 ; there exists a (c,y)-good set S C V/ with \S\ < \fm. 

Proof. Given V- Q Vi of size \ V-\ = jm/2, choose S QV- at random by including each vertex 
of V/ in S with probability l/7-y/m, independently of the outcome for each other vertex. Then 
by Proposition 12.31 with probability 1 — o(l), \S\ < yfm. 

Now, G[V,-i — > V/] and G[V- — > Vi+i] are each (2e/7)-regular with density at least d/2. 
So all but at most 2em/^ vertices Vi—\ £ V%—i have at least jdm/5 outneighbours in V- . Fix 
any such G V^_i. Then G[V/ n A^ + (^j_i) — > Vi + i] is (5e/7ii)-regular with density at least 
d/2. So all but at most hem/^d vertices Vi + \ G Vi+i have at least ^yd 2 m/20 inneighbours in 
V( n iV + (?;.;_i). We therefore conclude that all but at most 7em 2 /yd pairs (vi-ijUi+i) with 
Uj-l G Vj_i, G Vj + i have at least yd 2 m /20 common neighbours in V/. 

By Proposition for each such pair (vi-i, Ui+i) the probability that Vi+i) has fewer 

than d 2 y/m/2h common neighbours in 5* decreases exponentially with m, whilst the number of 
such pairs is quadratic in m. Thus with probability 1 — o(l) our randomly selected S will have 
the property that all but at most Tern 2 /yd pairs {vi-\, Uj+i) with G Vi_i, Uj+i G V^+i 
have at least d 2 y / m/25 common neighbours in 5*. We may therefore fix an outcome of our 
random choice of S such that both of these events of probability 1 — o(l) occur. 

So if |V^_i| > cm and | V^' +1 | > cm, then we know that at least c 2 m 2 /2 pairs 1, f i+i) with 
Vj-i G V^^j, Uj.fi G V^', j have at least d 2 y / m/25 common neighbours s£S. Thus there are at 
least c 2 d 2 m 5//2 /50 triples of such vertices (vj-i, s, Uj+i), so at least c 2 d 2 y / m/100 > jy/rn 
vertices in S must lie in the common neighbourhood of at least c 2 d 2 m 2 /100 such pairs 
(vi-i,Vi + i). (Otherwise there would be fewer than \ V/_ 1 \\V/ +1 \c 2 d 2 y/m/100+\S\c 2 d 2 m 2 /WO < 
c 2 d 2 m?l 2 /50 such triples s, Wj+i).) Each of these vertices therefore has at least c 2 d 2 m/100 > 

7m neighbours in each of V/_ 1 and V- +1 , as required. □ 

We will also make use of the following well known observation, which says that if G is a 
regular and dense cycle of cluster tournaments on clusters V\, . . . , Vfc, and we select subsets 
U\ C V±, . . . ,Uk Q Vk uniformly at random, then with high probability the restriction of G to 
these subsets is also regular and dense. This follows from a lemma of Alon et al. pQ showing 
that e-regularity is equivalent to almost all vertices having the expected degree and almost 
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all pairs of vertices having the expected common neighbourhood size. We include the proof 
for completeness. 

Lemma 2.6. Suppose that 1/m <^.k-^.e<^.e'-^.d and that m 1 / 3 < mf < m. Let G be 
an e-regular d-dense cycle of cluster tournaments on clusters V\, . . . , Vk, each of size m. For 
each is [A;], choose Ui^Vi of size m! uniformly at random, and independently of all other 
choices. Then with probability 1 — o(l), G[U\ U • • • U Uk] is an e 1 -regular d/2-dense cycle of 
cluster tournaments. 

Proof. We need to show that with high probability, G[Ui — > £7i+i] is e'-regular with density 
at least d/2 for each i. So fix some i £ [k], and let d, L > d be the density of G[Vi — > VJ+i]. Also, 
let Bi be the set of vertices v G Vi for which \N + (v) n Vi+\\ 7^ (di db e)m, and let Di be the 
set of pairs v\ / V2 of vertices of V{ for which \N + (vi) n N + (v2) fl 7^ (d 2 ± 3e)m. Then 
since G[V^ — > Vj+i] is e-regular, < 2em. Also, there are at most 2em 2 pairs in Di which 
contain a vertex of Bi, and each w£^ \ Bi lies in at most 2em pairs in .Dj, so \Dj\ < 4em 2 . 
So let B[ = Bi n C/j and similarly let L^ consist of the pairs in Di for which both vertices 
of the pair are in f/j. Then by Proposition 12.31 the probability that either \B[\ > 4em' or 
|-D^| > 8e(m') 2 declines exponentially with m. 

Now, for each of the at most m! vertices v G Ui\Bi, by Proposition 12. 31 the probability that 
|A^ + (?;)nL r i+i| 7^ (di±2e)m' decreases exponentially with m. Also, for each of the at most (™ ) 
pairs vi / v 2 with u l5 v 2 G J7j\ A, the probability that |A^+(?;i)niV + (u2)n[/i + i| / (c^±4e)m ; 
decreases exponentially with m. So with probability 1 — o(l), for each i none of these events 
of exponentially declining probability will hold. 

Fix such an outcome of our random choices. Then for each i at least (1 — 4e)m' vertices 
v G Ui have \N + (v\) fl Ui + \\ = {di ± 2e)m' and at least (™ ) — 8e(m') 2 pairs ^1,^2 G f/j 
have |A^ + (ui) n N + (v2) fl C/i+i] = (c? 2 ± 4e)m'. It then immediately follows from Lemma 3.2 
of pQ that for each i, G[Ui —> Ui+i] is e'-regular (and it is clear that this has density at least 
di/2 > d/2), as desired. □ 

We now turn to the concept of a robust outexpander. Let \i > 0, let G be a digraph on n 
vertices, and let S C V{G). Then the robust fi-outneighbourhood of S, denoted RN^(S), 
is defined to be the set of vertices of G with at least [in inneighbours in S. For constants 

< fj, < v < 1, we say that a digraph G on n vertices is a robust {[i,v)- outexpander if 
\RN+{S)\ >\S\ + un for all S C V(G) with im < |5| < (1 - i/)n. A recent result from [16] 
(which in turn relies on results from |13t [12] ) states that every robust outexpander with linear 
minimum semidegree contains a Hamilton cycle. We shall make use of this to prove the next 
lemma, which states that if a tournament G is a robust outexpander then G contains a regular 
and dense cycle of cluster tournaments which covers almost all of the vertices of G. We will 
use this structure when we embed a tree T in a tournament G which is a robust outexpander. 

Lemma 2.7. Suppose that 1/n < 1/M < 1/M' <e<d< j u<^<?7<l. Let G be a 
tournament on n vertices which is a robust outexpander with 6°(G) > nn. Then G 

contains an e-regular d-dense cycle of cluster tournaments on clusters V\, . . . ,Vk, where 

1 Uf=i Vi\ > (1 - e)n, and M' < k < M. 

Proof. Let R be the reduced digraph of G with parameters e, d and M' obtained by applying 
Lemma El and let k = \R\, so W < k < M. Then by Lemma 12 of [IB], <5°(i?) > n\R\/2, 
and i? is a robust (/u/2, 2i^)-outexpander. Then by Theorem 14 of [16], which states that any 
robust outexpander of linear minimum semidegree contains a Hamilton cycle, we know that 
R contains a Hamilton cycle. Let V\, . . . , V& be the clusters of R in the order of the cycle. 
Then | Uf=i | > ( 1 — e)rt, G[V^] is a tournament for each i and since — > Vi + i is an edge 
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of R for each i, G'[V% — > Vi+i] is e-regular with density at least d. (Here G' is the spanning 
subgraph of G obtained by Lemma 12.41 ) □ 

Of course, we will sometimes need to embed a tree T in a tournament G which is not a 
robust outexpander. The next lemma will be a useful tool in this situation; it states that if 
a tournament G is not a robust outexpander then V(G) can be partitioned into two sets so 
that most edges between the two sets have the same direction. 

Lemma 2.8. Suppose that 1/n <C (J* *C v, that G is a tournament on n vertices and that G 
is not a robust (fi, v)- outexpander. Then we can partition V{G) into sets S and S' such that 
vn < \S\, \S'\ < (1 - v)n and e(G[S -> S'}) < 4/xn 2 . 

Proof. Since G is not a robust (fi, z/)-outexpander there exists S C V(G) such that 
\RN+(S)\ < \S\ + fin and vn < \S\ < (1 - u)n. Choose such an 5, and let S' = V(G) \ S, so 
vn < \S'\ < (1 — v)n also. 

Since G is a tournament, at most 2/m + 1 vertices v G S have d^,g, (v) < fin, and so at 
most 2fin + 1 vertices v G S have v (£ RN+(S). So \RN+(S) \ S\ < 3/m + 1, and so the 
number of edges from S to S' is at most (3/xn + + fm\S'\ < 4/in 2 . □ 

2.4. Basic Tree Properties. In this section, we shall prove several lemmas which we shall 
make use of in proving Theorem 11.41 The first two of these will enable us to split a tree into 
several pieces with properties that will be useful for the analysis of the randomised embedding 
algorithm used in Section [3l 

Lemma 2.9. Let T be a tree on n > 3 vertices. Then there exist subtrees T' and T" of T 
such that T' and T" intersect in precisely one vertex of T , every edge of T lies in precisely 
one ofT' and T" , and e(T'),e(T") > e(T)/3. 

Proof. We begin by showing that T must contain a vertex v such that every edge e incident 
to v has w e {v) < n/2. Recall that if e = uv, then w e {u) + w e {v) = n, and so at most one of 
w e (u) > n/2 and w e (v) > n/2 can hold. Since T contains n vertices and n — 1 edges, by the 
pigeonhole principle T contains a vertex v so that no edge e incident to v has w e (v) > n/2. 

Now, choose such a vertex v in T, and let vi,...,v r be the neighbours of v in T. For 
each i, let Si be the set of vertices x of T such that V{ lies on the path from v to x. Then 
every vertex of T other than v lies in precisely one set Si- Now, for each i, let Tj be the tree 
T[Si U {v}]. Then each Tj is a subtree of T and every edge of T is contained in precisely one 
Tj. So {e(Tj) : i £ [r]} is a set of positive integers, none greater than 2(n — l)/3, which sum 
to n — 1. Thus there exists A C [r] such that the sum of elements of {e(Tj) : i G yl} lies 
between (n - l)/3 and 2(n - l)/3. Then if we take T' = \JieA T i and T " = Ui^A T i then T ' 
and T" satisfy the conditions of the lemma (in particular, T' n T" = {v}). □ 

Lemma 2.10. Suppose that 1/n <C 1/ A,e,l/k. Let T be a tree on n vertices satisfying 
A(T) < A and rooted at t\. Then there exist pairwise disjoint subsets F\, . . . ,F r of V(T), 
and vertices vi,...,v r (not necessarily distinct) of T such that: 

(1) |Ui 6[r ]*il>(l-eH 

(2) \Fi\ < n 2 / 3 for each i. 

(3) For any i £ [r], let x G {t\} U Uj<i Fj, and let y G Fj. T/ien i/ie pat/i /rom x to y inT 
includes the vertex Vi . 

(4) For any y £ Ft we have dT{vi,y) > A; 3 . 
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Proof. We begin by splitting T into a family T of subtrees of T by repeated use of Lemma f2. 9 1 
So initially let T = {T}, and then we repeat the following step. Let Ti arge be the largest 
member of T . Use Lemma 12.91 to split Ti arge into subtrees T' and T" which intersect in 
a single vertex, partition the edges of Ti arge , and satisfy e(T'),e(T") > e(Ti arge )/3. Then 
remove Ti arge from T , and replace it by the two smaller trees T' and T" . After at most 377. 1 / 3 
steps we must have that \T*\ < n 2//3 for every T* £ T . At this point we terminate the process. 

Observe that if T',T" £ T, then T' and T" intersect in at most one vertex. Now, form a 
graph Qjr with vertex set T and with an edge between T' , T" £ T if and only if T' and T" have 
a common vertex. Then Qj is connected, and so contains a spanning tree Tf- Choose To to 
be a member of T containing the root t\ of T, and let To, T\, . . . , T r be an ancestral ordering 
of the members of J- (thought of as vertices of the tree 7f)- Now, for each 1 < i < r let Vi 
be the common vertex of T and its parent in Tf- Then define Pj for each i £ [r] by 

Fi = V{Ti) \ {x £ T : d T (v u x) < k 3 }. 

It remains to show that Pi, . . . , F r and v\, . . . ,v r satisfy the required properties. (4) is imme- 
diate from the definition of Fi, and (2) holds since each Tj contained at most n 2//3 vertices. 
For (1), observe that every vertex of T was contained in at least one of the subtrees T, and 
that in forming the sets Fi, we deleted at most A fc3 vertices from each of the at most 3n x / 3 
sets V(Ti), so in total at most 3n 1//3 A fc < en vertices of T are not contained in any of the 
sets Fi. 

For condition (3), suppose that T[T 2 T^ is a path in Tp, and some vertex v lies in T{ n Tg, 
but v T' 2 . Let v' £T[D T' 2 and let v" £ T^nTg. Then v' ^ v" , as otherwise T[ and Tg would 
have a common vertex other than v. So there is a path from v' to v" in T which does not 
contain v, so T contains a cycle, giving a contradiction. Similarly it follows that for any path 
Ti x . . . Ti i in Tj:-, if and have a common vertex t>, then v lies in each of , . . . , Tj . , and 
so if Tj j _ 1 is the parent of T^ in Tf then v = v^. Now, for any % £ [r], if x £ {ti} U U^Ki-^j 
and y £ Fi, then x G Tj for some < j < i and y £ Ti. Let TjT{ . . . T S 'T be the path from Tj 
to T in Tf, then T^ is the parent of T in Tf- So Tj U T{ U • • • U T' s contains a path Pi 
from x to Uj and T contains a path P2 from Vi to y. But the property we have proved before 
implies that Pi and P2 only intersect in Vi. Thus Pi U P2 is the path in T from x to y, and 
tij G P1UP2, as required. It also follows that the sets Pj are pairwise disjoint. □ 

Recall that in Section [3l we will describe a randomised algorithm for embedding the vertices 
of a tree T in a digraph G. Whenever this algorithm embeds a vertex t of T in G, it will 
reserve a set of vertices of G in which to embed the children of t. No other vertices may be 
embedded in this set until all the children of t have been embedded. For this to work, we need 
to ensure that there will not be too many of these reserved sets at any point. This motivates 
the following definition. If T is a rooted tree on n vertices, then we say that an ancestral 
ordering of the vertices of T is tidy if it has the property that for any initial segment I of 
the order, at most log 2 n vertices in I have a child not in I. The following lemma shows that 
such an order exists for any tree T. 

Lemma 2.11. Let T be a tree on n vertices rooted at some to £ T. Then there exists a tidy 
ancestral ordering of the vertices ofT. 

Proof. We shall prove that for any r, the vertices of any rooted tree T on fewer than 2 r 
vertices can be given an ancestral ordering so that fewer than r vertices in any initial segment X 
have neighbours outside X. Indeed, suppose that this statement is false, and let T rooted at to 
be a counterexample of minimal order, say of order n. Let r be minimal such that n < 2 r . 
Then let T\,...,T S be the components of T — to, ordered in increasing size, and let ti be the 
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neighbour of to m li- We shall think of U as the root of the tree T{. Then |Tj| < 2 r_1 for 
i < s — 1, and T s < 2 r . So since T was a minimal counterexample, we can find an ancestral 
ordering of the vertices of each Tj so that for any i < s—1, any initial segment of the order of 
the vertices of Tj contains fewer than r — 1 vertices with children outside the initial segment, 
and any initial segment of the order of the vertices of T s contains fewer than r vertices with 
children outside the initial segment. Now, we order the vertices of T as follows. Begin with to, 
then add the vertices of T\ in their order. Next, add the vertices of T2 in their order, and 
continue in this fashion. Since the order of the vertices of each Tj was ancestral, this order is 
also ancestral. Also, any initial segment X of this order contains fewer than r vertices with 
children outside T, contradicting the choice of T, and therefore proving the lemma. □ 



3. Embedding trees of bounded maximum degree in a robust outexpander. 

3.1. Introduction. Our aim in this section is the following lemma on embedding trees of 
bounded maximum degree in robust outexpander tournaments. 

Lemma 3.1. Suppose that 1/n <C u <C v <C rj <C a, 1/A, that G is a tournament on (1 + a)n 
vertices which is a robust (/j,,u)- outexpander with 5°(G) > r\n and that T is a directed tree 
on n vertices with A(T) < A. Then G contains a copy ofT. 

The proof of this lemma shows that we could actually put 1/A lower down in the hierarchy, 
but this is how will shall apply this lemma later on. To prove this lemma, we begin by applying 
Lemma 12.71 to find a regular and dense cycle of cluster tournaments in G, containing almost 
all of the vertices of G. We will then use Lemma f3.2l to find a copy of T within this structure. 
This lemma is stated separately, and in a stronger form than necessary, as we shall also make 
use of it in Section HI 

Lemma 3.2. Suppose that 1/n <C 1/k, 1/A <^e<^d<^a<2, and that m = n/k. Let G be 
an e-regular d-dense cycle of cluster tournaments on clusters V%, . . . , Vu of equal size (l+a)m. 
Let v* be a vertex of V\ with at least d 2 m inneighbours in Vu and at least d 2 m outneighbours 
in V%. Finally, let T be a directed tree on n vertices, rooted at t% and with A(T) < A. Then G 
contains a copy of T , where the vertex t\ of T corresponds to the vertex v* of G. 

The main problem in achieving this is to allocate the vertices of T to the clusters V{ in 
such a way that we can then use the e-regularity of each G[Vi — > Vi+i] to embed the vertices 
of T in G. When we say we allocate v to V{ this means that v will be embedded to a vertex 
of Vi, but this embedding has not been fixed yet. We wish to allocate each vertex of T to a 
cluster Vi so that for most edges u — > v of T, if u is allocated to Vi then v is allocated to Vi+%. 
So if u is allocated to a cluster Vi and u — > v then we say that the canonical allocation of v 
is to the cluster Vi+i, whereas if u <— v then we say that the canonical allocation of v is to 
the cluster Vi-\. If we allocate v canonically, then we say that the edge between u and v 
has been allocated canonically. One way of allocating the vertices of T to the clusters Vi 
would be to begin by allocating the root t\ to V\, and then to allocate all remaining vertices 
canonically. However, to successfully embed the vertices of T within the clusters to which they 
are allocated we will need the vertices of T to be approximately evenly distributed amongst 
the k clusters. This method will usually not achieve this, for example if T is an anti-directed 
path. 

To obtain an 'even distribution' for any sufficiently large tree of bounded maximum de- 
gree, we modify the method so that some vertices (selected randomly) are allocated to the 
same cluster as their parent, rather than being allocated canonically. However, having large 
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components of vertices which are allocated to the same cluster may prevent a successful em- 
bedding of these vertices within this cluster, and so we shall also require that such components 
are small. This is the motivation behind the Vertex Allocation Algorithm given in the next 
subsection, which we shall use to allocate the vertices of T. 

3.2. Allocating the vertices of T. We shall use the following random process to allocate 
the vertices of T to the clusters V,. 

Vertex Allocation Algorithm: 

Input: A directed tree T on n vertices, a root vertex t\ € T, and clusters Vi, . . . , Vk- 

Initialisation: Choose an ancestral ordering ti,...,t n of the vertices of T. 

Procedure: At time r = 1, allocate t\ to V\. At time r > 1, we shall allocate t T . Let t a 

be the parent of t T , which must have appeared before t T in the ordering and has therefore 

already been allocated. Then: 

• If d,T(t T ,ti) is odd, then allocate t T canonically. 

• If dT{t T ,h) is even, then allocate t T to the same cluster as t a with probability 1/2, 
and allocate t T canonically with probability 1/2 (where these choices are made inde- 
pendently for each vertex). 

Termination: Terminate when every vertex of T has been processed and therefore allocated 
to some cluster Vj. 

Note that the cluster to which a vertex t is allocated by this algorithm depends only on the 
cluster to which its parent vertex was allocated and the outcome of the random choice when 
embedding t (if d(t,ti) is even). Since these choices were independent, the probability of any 
possible outcome does not depend on which ancestral order of the vertices was chosen in the 
initialisation step. Now, we say that an edge of T is allocated within a cluster if both of its 
endvertices are allocated to the same cluster. Then we say that an allocation of the vertices 
of a directed tree T to clusters V\, . . . , Vk is semi- canonical if 

(i) every edge of T is either allocated canonically or is allocated within a cluster, 

(ii) every edge of T incident to t\ is allocated canonically, and 

(iii) every component of the subgraph of T formed by all edges allocated within a cluster 
contains at most A(T) vertices. 

The next lemma shows that if we allocate the vertices of a directed tree T to clusters 
Vi,...,Vk by applying the Vertex Allocation Algorithm, then the allocation obtained will be 
semi-canonical, and also that if vertices t and t' are far apart in T then the cluster to which t is 
allocated is almost independent of the cluster to which t' is allocated. As a consequence, if T 
is sufficiently large and has bounded maximum degree, each cluster will have approximately 
equally many vertices of T allocated to it. These properties will allow us to embed the vertices 
of such a T into a regular and dense cycle of cluster tournaments G in the next subsection. 

Lemma 3.3. Let T be a directed tree on n vertices rooted at t\. Allocate the vertices ofT to 
clusters Vi,...,Vk by applying the Vertex Allocation Algorithm. Then the following properties 
hold. 

(a) The allocation obtained will be semi-canonical. 

(b) Suppose that 1/k <C 5. Let u and v be vertices of T such that u lies on the path from 
t\ to v, and dx{u,v) > k 3 . Then for any i,j 6 [k], 

¥(v is allocated to Vi \ u is allocated to Vj) = — — . 
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(c) Now suppose also that 1/n <C l/A,l/k <C 5, and that A(T) < A. Then with proba- 
bility 1 — o(l), each of the k clusters Vi has at most (1 + S)m vertices of T allocated 
to it, where m = n/k. 

Proof, (a) The Vertex Allocation Algorithm allocates every vertex either canonically or to 
the same cluster as its parent, so every edge will be allocated canonically or within a cluster. 
Furthermore, a vertex t can only be allocated to the same cluster as its parent if d(t\,t) is 
even, and so each edge incident to t\ is allocated canonically. Finally, since edges allocated 
within a cluster can only be formed when we allocate such that d(ti,t\) is even, any such 
component is a star formed by some tj and some of the children of tj. 

(b) Since the order in which the vertices are allocated is ancestral, at the stage in our 
algorithm when we have just allocated u, no other vertex on the path P(u, v) in T from u 
to v has yet been allocated. So suppose that we have just allocated u to cluster Vj, let I be 
the length of P(u, v), so i > k 3 , and let u = Vq, v\, . . . , V£ = v be the vertices of P(u, v). Then 
let E = {i > 1: d(vi,ti) is even}, so E indicates the vertices with a random element in their 
allocation, and let O = [$\\E, so O indicates the vertices which are allocated deterministically. 
We then split the edges of P(u, v) into four classes: 



TP 

1 canon 


= {Vi- 


-1 - 


-> Vi 


:i£0} 


-^canon 


= {Vi- 


-1 « 


- Vi 


:i£0} 


-^random 


= {Vi- 


-1 - 


->■ Vi 


:i£E} 


-^random 


= {Vi- 


-1 * 


- Vi 


:i€E} 



Then every edge of P(u, v) lies in one of these 4 sets, and so l-Fcanon \ + \B canon -^random + 
I -6 ran dom I = Furthermore, each edge in -^canon will be allocated canonically, and hence from 
some Vi to Vi + \. Similarly, edges in -B C anon will be allocated from some Vi to V^_i. Meanwhile, 
cd ges in -F r andom ° r -S ran dom wm be allocated from some Vi to Vi+i or V,-\ respectively with 
probability 1/2, and within some Vi with probability 1/2. So let R be the sum of the number 
of edges from -F ra ndom which are allocated canonically and the number of edges from -B r andom 
which are not allocated canonically. Since the outcome of the random experiment for each 
edge is independent of the outcome for any other edge, R has distribution B(\E\, 1/2). Now, u 
was allocated to cluster Vj, and so v will be allocated to cluster Vi, where 

i = j "I - l^canonl |-^canon| R |-^random| mod k. 

But since \E\ > > k s /3, Lemma |2 . 1 1 applied with X = R and n = \E\ implies that for 

any r € [k], the probability that i = r is , as desired. 

(c) Use Lemma ETTOl to choose pairwise disjoint subsets F\, F%, . . . , F r of V(T) and vertices 
V\,...,v r G V(T) such that | (JieH — 0- ~ 3/2k)n and \Fi\ < n 2 / 3 for each i, also such 
that if j < i, then any path from t\ or any vertex of Fj to any vertex of F^ passes through 
the vertex Vi, and finally such that d(vi,Fi) > fc 3 . We shall prove that (f) with probability 
1 — o(l), the total number of vertices from any of the sets Fi allocated to cluster Vj is at most 
(1 + 5/2)m. This will prove the lemma, as the number of vertices of T not contained in any 
of the sets Fi is at most 6m/2, and so in total at most (1 + 5)m vertices of T are allocated to 
any cluster Vj. 

To prove (f), define random variables X? for each i E [r], j € [k] by 
j ff of vertices of Fi allocated to cluster Vj 

so that each Xj lies in the range [0,1]. Then since the cluster to which a vertex t of T 
is allocated is dependent only on the cluster to which the parent of t is allocated and on 
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the outcome of the random choice made when allocating t, E(.X| | X J i _ 1 , . . . ,X^,Vi E V s ) = 

E(X^ | Vi E V s ) for all s E [k]. Here we write Vi E V s to denote the event that vi is allocated 
to V s . So for any i and j, 

E(X? \X{_ V ..., X{) < maxE(Xf | X{_ v . . . ,x(, Vi E V s ) = maxE(X/ | Vi E V s ) 

Z^nxtVjl^tVs) ^ (1 + 5/4)1^1 
= max ■ -7- < 



ae[k] n 2 / 3 kn 2 / 3 

using (b). So, by Lemma 12.21 with probability 1 — o(l) we have that for each j, 

V X j (l + S/2)m 

Z-> i — „2/3 



n 2/3 



and so for each j, the total number of vertices from any of the sets Fi allocated to cluster Vj 
is at most (1 + 5/2)m, proving (f). □ 



3.3. Embedding the vertices of T. Suppose that we have applied the Vertex Allocation 
Algorithm to find an approximately uniform allocation of the vertices of T to the clusters 
of G. We now wish to embed T in G so that each vertex is embedded in the cluster to which 
it is allocated. In principle we could use the blow-up lemma for this. However numerous 
complications arise, for instance because we embed some edges within clusters and because 
we allow A to be comparatively large in Section HI Instead, we embed the vertices of T as 
follows. 

Firstly, to deal with the problem of edges which are allocated within a cluster, we shall 
embed components of T formed by such edges at the same time, using Theorem II. 21 (it would 
also be easy to do this directly). To do this we make the following definition. Let T be a tree 
on n vertices with root t%, and let the vertices of T be allocated to clusters Vi, . . . , Vk by a 
semi-canonical allocation. Then the canonical tree T canon of T is formed by contracting to a 
single vertex each component of the subgraph of T formed of edges which are allocated within 
a cluster. Since the allocation is semi-canonical, each such component contains at most A 
vertices - we say that these vertices correspond to that contracted vertex in T canon . Note also 
that no edge incident to t\ is contracted; we let the root of T canon be the vertex corresponding 
to t\. We shall proceed through all of the vertices of T canon in turn using a tidy ancestral 
order, and at time r we will embed all of the vertices of T which correspond to the vertex r 
of T canon in one step. 

Secondly, we must ensure that at each time r it is possible to carry out this embedding. 
To do this, each time we embed a vertex t E T to a vertex v E G, we will use Lemma 12.51 to 
select sets Af and A~j~ of outneighbours and inneighbours of v in the clusters succeeding and 
preceding that of v, each of size at most 2y/m, which are reserved until all of the children of 
t have been embedded. Indeed, while these sets are reserved, no vertices may be embedded 
in them other than 

(i) the children of t, and 

(ii) those vertices of T which correspond to the same vertex of T canon as a child of t. 

We shall refer to these vertices as the canonical children of t; observe that there are at most A 2 
such vertices. Since we are proceeding through the vertices of T canon in a tidy ancestral order, 
this means that at any time r not too many such sets will be reserved, and so only a small 
proportion of the vertices of any cluster will be reserved. When we later come to embed a 
child t' of t for which the edge tt' was allocated canonically, we embed t' in Af (if t t') or 
Aj (if t <— t') in such a way that we can choose Af, and A^, as desired. 
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When reading the next algorithm, one should bear in mind that often it is not apparent 
that a choice can be made as required by the algorithm. Indeed, if such a choice is not 
possible then the algorithm terminates with failure. Lemma 13.41 will show that under certain 
conditions on G, it will always be possible to make such choices, and so we can be sure that 
the algorithm will succeed. 

Vertex Embedding Algorithm 

Input: 

• A tree T rooted at t\. 

• A constant a and a positive integer m. 

• A digraph G on vertex set V = V\ U • • ■ U Vk, where each Vj has size (1 + a)m, and 
a semi-canonical allocation of the vertices of T to the clusters Vj, with t\ allocated 
to V\. 

• Finally, a vertex v* G Vi to which t\ should be embedded, and constants c and 7. 
Initialisation: Form the canonical tree T canon of T as explained above, and choose a tidy 

ancestral ordering 1, 2, . . . , n' of the vertices of T canon . Let t\, . . . , t n be a corresponding order 
of the vertices of T (so if tj G T corresponds to i £ T canon and tj G T corresponds to j G T canon 
then tj appears before tj if and only if i < j.) 

Procedure: At time r we shall embed the vertices t r , . . . ,t r + s _ 1 of T corresponding to 
vertex r of T canon . Each vertex tj will be embedded to a vertex m of G, where x>\ = v* . Then, 
for each tj we will reserve sets Af. and A~[. of vertices of G for the canonical children of tj . To 
do this, at each time r with 1 < r < n', take the following steps. 

(1) We say that a vertex tj of T is open at time r if tj has been embedded but some child 
of tj has not yet been embedded. Define the set B T of vertices of G unavailable for 
use at time r to consist of the vertices already occupied and the sets reserved for the 
canonical children of open vertices, so 

ti : ti is open 

For each cluster Vj, let Vj = Vj \ B T ', so VJ is the set of available vertices of Vj. 

(2) If r = 1 embed t\ to v±. Alternatively, if r > 1: 

(2.1) Precisely one of the vertices t r , . . . , t r + s _i of T corresponding to vertex r of T canon 
has a parent already embedded; we may assume this vertex is t r . Let t p be the 
already-embedded parent (so p < r, and when t p was embedded sets ^4+ and Aj p 
were chosen). Let Vj be the cluster to which t p is embedded. 

(2.2) If t p — > t r , choose a set S of 3s vertices of Af p C Vj+i such that for each v G S 

\N + (v) n Vj +2 \ > jm and \N~(v) C\Vf\ > 7m. 
If t p -<— t r , choose a set 5 of 3s vertices of C V^_i so for each v £ S 
\N + (v) HV[\ > 7m and \N~(v) D V[_ 2 \ > 7m. 

(2.3) Then choose a copy of T[t r , . . . , t r+s _i] in G[S], and embed each vertex tj to the 
corresponding vertex Vi in this copy. 

(3) In step (2), we embedded each of t r , . . . , t r + s _i in the same cluster; let V q be this 
cluster. For each r<i<r + s — 1, choose sets 

A+ C N+(vi) n V; +1 and ^ C N~ (vi) n ^Lj 

such that the sets and are all pairwise disjoint, each Af. and each is 
(c, 7)-good, and \Af.\, \ Aj~. \ < for each i. 
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Whenever there are several choices (for example if there are several possibilities for S in 
(2.2)), take the lexicographically first of these. This ensures that for each input, the output 
is uniquely defined (i.e. we can view the algorithm as being deterministic). 

Termination: If at any point it is not possible to make the choice required, terminate with 
failure. Otherwise, terminate after every vertex of T^non has been processed, at which point 
ip(U) = Vi for each U € T is an embedding ip of T into G, by construction. 

Lemma 3.4. Suppose that l/n<^.l/A,l/k<^.e<^. r y<^.c<^.d<^.a<2, and let m = n/k. 

(1) Let T be a directed tree on at most n vertices with root ti and A(T) < A. 

(2) Let G be an e-regular d-dense cycle of cluster tournaments on clusters Vi, . . . , V&, each 
of size (1 + a)m, and let v* £ V\ have at least 7m inneighbours in V^ and at least 7m 
outneighbours in V%. 

(3) Let the vertices ofT be allocated to the clusters Vi, . . . , V& so that at most (1 + a/2)m 
vertices are allocated to any one cluster Vi, and so that the allocation is semi-canonical. 

Then if we apply the Vertex Embedding Algorithm to T and G with this allocation and con- 
stants c and 7, then it will successfully embed T into G with t\ embedded to v* . 

Proof. The Vertex Embedding Algorithm will only fail if at some point it is not possible to 
make the required choice. So to demonstrate that the algorithm will succeed, it is enough to 
show that it is always possible to make the required choices. 

In the initialisation we are required to choose a tidy ancestral ordering of the vertices of the 
rooted tree T canon ; the existence of such a choice is guaranteed by Lemma \2. Hi Now, consider 
the set of unavailable vertices B T at some time r. Since the Vertex Embedding Algorithm 
embeds each vertex in the cluster to which it was allocated, we know that at most (1 + a/2)m 
vertices of each Vj are already occupied. Furthermore, suppose that vertex ti of T is open 
at time r. Then ti must correspond to a vertex r' < r of T canon , such that r' has a child 
t" > t. Since we are processing the vertices of T canon in a tidy order, there can be at most 
log 2 n' < log 2 n such vertices of T canon . As each vertex of T canon corresponds to at most A 
vertices of T, at most A log 2 n vertices of T are open at time r. Therefore, at any time r, the 
total number of vertices in reserved sets Af. and A~[. is at most 4Ay / mlog 2 n < am/4. So for 
any cluster Vj, at any time r at most (1 + a/2)m + am/4 vertices of Vj are unavailable, and 
so \Vj~\ > am/4. 

We can now demonstrate that it is possible to make the other choices that the algorithm 
asks for. Indeed, in step (2.2), if t p —> t r with t p embedded into Vj, then the algorithm has to 
choose a set S of 3s < 3A vertices of Af p such that each v G S has \N + (v) n VJ +2 \ > 7m and 
|iV~(i;)n Vj~\ > 7m. But when A^ was chosen at an earlier time r', it was chosen to be (0,7)- 
good. Since the vertex v p to which t p was embedded is in cluster Vj, Af p C Vj+i. Moreover, 
since \V?~\ > am/4 > (1 + a)cm and |V/^ 2 | > am/4 > (1 + a)cm, Af p must contain at least 
jy/m vertices v such that \N + (v) n V-~ +2 \ ^ l m an d \N~(v) n VT\ > 7m. Furthermore, since 
t r is a child of t p , t p has been open since its embedding, and so only canonical children of t p (of 
which there are at most A 2 ) can have been embedded in Af p . So it is indeed possible to select 
such a set S of 3s vertices as required. The argument for the case when t p <— t r is similar. 
As for (2.3), observe that G[S] is a tournament on 3s vertices, and that T[t r , . . . , t r+s _i] is a 
directed tree on s vertices. So by Theorem 11.21 G[S] contains a copy of T[t r , . . . ,t r + a -l], so 
we may choose such a copy. 

Finally we come to step (3). In this step we have just embedded at most A vertices 
t r , . . . ,t r+s -\ in some cluster V q , and we wish to choose sets Af. and A^ for each such 
vertex ti. When embedding these vertices we ensured that for each i the vertex Vi to which ti 
was embedded satisfied | N + (vi)(l VZ +1 \ > 7m (for r = 1 this holds instead by the condition on 
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the outneighbours of v* = v\ ) . So suppose we have chosen Aj r , AJ r+i , . . . , A^ _ x and we now 
wish to choose Af { . Then the previously chosen Af. contain at most 2Ay / w vertices between 
them, and so at least ^raj\ > (1 + a)^m/2 vertices of N + (vi) n VJ~ +1 have not been used in 
these previous sets. So by Lemma [231 we may choose a (c, 7)-good set Af ( C N + (vi) n VJ +1 
of size at most 2y/m which is disjoint from all of the previously chosen Af_. Do this for each 
vertex ti in turn; the choice of the sets A^_ is similar. □ 

We can now give the proof of the main lemmas of this section, beginning with the proof of 
Lemma 13.21 

Proof of Lemma 13.21 Apply the Vertex Allocation Algorithm to allocate the vertices of T 
to the clusters V\, . . . , V&. Then by Lemma 13.3( a) this allocation is semi-canonical, and by 
Lemma 13.3( c) at most (1 + a/2)m vertices are allocated to each of the k clusters Vi. Next, 
apply the Vertex Embedding Algorithm to T and G, giving this allocation as input. By 
Lemma |3,4[ this will successfully embed T in G with t\ embedded to v* . □ 



Proof of Lemma 13.11 If a > 2 then G contains a copy of T by Theorem 11.21 So we may 

assume that a < 2. We begin by introducing new constants 1/n <C 1/M <C 1/M' <C £ <C e' <C 
d <C /U. Then by Lemma 12.71 G contains an e-regular d-dense cycle of cluster tournaments G' 
on clusters Vi, . . . , V k , where W < k < M, and |Vi| = • • • = \V k \ > (1 - e)(l + a)n/k > 
(1 + a/2)n/k. For each i choose V- Q Vi of size \V-\ = (1 + a/2)n/k uniformly at random. 
By Lemma 12.61 we may fix an outcome of these choices so that G" = G' [V[ U • ■ • U V fc '] is a 
e'-regular d/2-dense cycle of cluster tournaments. So by Lemma [3.21 G" contains a copy of T, 
so G contains T also. □ 

We finish this section with an analogous result to Lemma [3.2l for small trees (i.e. the result 
does not demand that \T\ is large compared to |G|). 

Lemma 3.5. Suppose that 1/m <C 1/k, 1/A<^e<^d<^a<2, and that l/k >C 5. Let G be 
an e-regular d-dense cycle of cluster tournaments on clusters Vi, . . . , V k , each of size (l+a)m, 
and let v* G Vi have at least d 2 m inneighbours in V k and at least d 2 m outneighbours in V2. 
Let T be a directed tree on at most m vertices, rooted at t\ and with A(T) < A, and let T' ar 
be the subgraph of T induced by the vertices x E T with d{t\,x) > k s . Let Qt denote the set 
of copies of T in G for which the vertex t\ of T corresponds to vertex v* of G. Then Qt is 
non-empty. Furthermore, there exists a probability distribution on Qt such that if a member 
of Qt is selected at random according to this distribution, then for each i, 

E(# vertices ofT far embedded in Vi) < G-LjUZL [ . 

k 

The probability distribution will actually be, for each member of Qt, the probability that 
applying first the Vertex Allocation Algorithm and then the Vertex Embedding Algorithm 
gives this copy of T in G (recall that actually the Vertex Embedding Algorithm is purely 
deterministic). 

Proof. Apply the Vertex Allocation Algorithm to allocate the vertices of T to the clusters Vi. 
Since |T| < m, at most m vertices can be allocated to any cluster, and the allocation obtained 
is semi-canonical by Lemma EOl fa). Next, introduce constants £ < 7 < c < d, and apply 
the Vertex Embedding Algorithm to embed T in G. By Lemma 13.41 this will successfully 
embed T in G, with t\ embedded to v* , and every vertex of T embedded in the cluster to 
which it was allocated. So it remains only to show that for each i, the expected number of 
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vertices of T far allocated to V% is at most (1 + 5)\T ia,r \/k. But since for any x G T far we have 
d(x,t\) > k 3 , by Lemma IBTBT b) applied with u = t±, 

F(x is allocated to Vi) = - ^ — 
for each i, and the result follows. □ 



4. Embedding trees of unbounded maximum degree in a robust outexpander. 

4.1. Section outline. Having proved the desired result for trees of bounded maximum de- 
gree, we now move onto proving a similar result for trees with no such bound, with a constant 
of 2 rather than 1 in the condition on the order of G. This is the following lemma. 

Lemma 4.1. Suppose that 1/n <C u -C v <C tj <C a, that G is a tournament on 2(1 + a)n 
vertices which is a robust (u,v)- outexpander with 5°(G) > r\n and that T is a directed tree 
on n vertices. Then G contains a copy of T . 

To prove this, we shall begin with a definition. In Section l4~2l we shall define the core tree T c 
of a tree T. This is a subtree of T which has bounded maximum degree, and the property that 
all components of T — T c are small. Then in Section \4. 41 we will show that T c can be extended 
to an 'extended tree' T ex t which also has bounded maximum degree, and also has the property 
that few vertices of T ext have neighbours outside T ext . We will embed the extended tree T ext 
by a similar method to that of the previous section. We will need to do this so that the small 
number of vertices of T ext with neighbours outside T ext are embedded to vertices of G with 
large in- and outdegree in G. In Section 14.51 we will use our results from Section [3] to prove 
Lemma 14.61 on embedding trees of bounded maximum degree. This is similar to Lemma 13.21 
but allows us also to demand that a small subset H C V(T) of the vertices of T, satisfying 
certain conditions, should be embedded in a small subset U of the vertices of G. This will 
allow us to embed T ex t in G in the desired manner. Finally, in Section 14.61 we will complete 
the proof of Lemma [4. II by first using Lemma [4.6l to embed T ex t into G as described and then 
embedding each component of T — T ext in the unoccupied vertices of G. 

4.2. The core tree. Let T be a tree on n vertices, and let A > 2 be fixed. Then we say 
that a vertex v of T is A-core if every edge incident to v has w e (v) < (1 — 1/A)n. We call 
the subgraph of T induced by A-core vertices of T the core tree of T with parameter A, and 
denote it by T c (the value of A will always be clear from the context). With this definition, 
for any tree T, the core tree of T with parameter A is the same as the A-heart of T considered 
by Thomason and Haggkvist in [6]. The statements of the next proposition are also noted in 
Section 3 of [6], but we include the proof for completeness. 

Proposition 4.2. Let T be a tree on n vertices, let A > 2 and let T c be the core tree of T 
with parameter A. Then: 

(i) T c is a tree containing at least one vertex. 

(ii) w e (x) > n/A if e = xy is an edge of T c . 

(iii) A(T C ) < A. 

(iv) Every component subtree T' of T — T c has \T'\ < n/A. 

Proof. For (i), note that since A > 2, for any edge e = uv of T at most one of w e (u) > 
(1 — 1/A)n and w e (v) > (1 — 1/A)n holds. Since T has more vertices than edges, there must 
therefore be some vertex v G T such that w e (v) < (1 — 1/A)n for every edge e incident to v, 
and so v G T c . It remains to show that T c is connected. Observe that if u, v, w are distinct 
vertices of T such that there is an edge between u and v and an edge between v and w, then 
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w uv (u) > w vw (v). Now, suppose x, y G T c , and let x = vi, V2, ■ ■ ■ , v r = y be the vertices of the 
path from x to y in T (in order). Suppose for a contradiction that some Vi is not in T c . Then 
for some neighbour z of v^, w ViZ {vi) > (1 — 1/A)n. If 2 ^ Vj+i, then for each i < j < r — 1 we 
have io v -t;- +1 (7;j+i) > (1 — 1/A)n, and so y ^ T c , giving a contradiction. On the other hand, 
if z = Vi+i, then for each 2 < j < i, w Vj _ lVj (vj-i) > (1 — 1/A)n, and so x ^ T c , again giving 
a contradiction. 

Now, (ii) is immediate from the fact that if e = xy is an edge of T then w e (x) + w e {y) = n. 
Then (hi) follows directly from (ii), as the sum of w e (v) over all edges incident to v is n — 1. 

Finally, for (iv), observe that for any such T' there is u G T', i> G T c with e = w an edge 
of T. Suppose that |T'| > n/A. Then w e (v) > \T'\ > n/A, and so w e (u) < (1 - 1/A)n. 
But since w e >(u) < w e (v) < (1 — 1/A)n for every other edge e' incident to u, this means that 
it G T c , giving a contradiction. □ 

Note that T c is an undirected tree obtained from an undirected tree T. However we will 
often refer to the core tree of a directed tree T; this means the directed tree formed by taking 
the core tree T c of the underlying graph T un d er (an undirected tree) and directing each edge 
of T c as it is directed in T. 

The idea behind this definition is that the core tree is a bounded degree tree. The general 
technique we shall use to work with a tree T of unbounded maximum degree (in both this 
and later sections) is to hrst consider the core tree T c , and then consider separately each 
component of T — T c , making use of the fact that each such component is small. 

4.3. Leading paths. Let T be a tree on n vertices, rooted at ti, let H C V(T), and let A; be a 
positive integer. For any vertex x G T, there is a unique path in T from x to t±; let P x denote 
the set of the first k vertices of this path, starting from x. Let H 1 = [j x( z H P x , and then for 
each i > 1 let H l+1 be formed from H % by adding the vertices of P x for any x G H l with at 
least two children in H l . After at most n steps we must have H l = H t+l , when we terminate 
the process. We refer to this final H l as H with leading paths included, denoted Vk{H). So 
H C V k {H) C1/(T). Note that V k {H) depends on both the value of k and the root t r of T. 

Next we shall prove two results which will enable us to make use of this definition. The 
first shows that if H is small then Vk{H) is small, and the second shows that if H is small 
then it is possible to embed any component T' of T[Pk(H)] in a regular and dense cycle of 
cluster tournaments such that the vertices of V(T') n H are embedded in the first cluster and 
the 'root' of H is embedded in a given cluster. 

Proposition 4.3. Let k be any positive integer, let T be a tree on n vertices, rooted at some 
ti G T, and let H C V(T). Then \V k {H)\ < 3k\H\. 

Proof. Consider any component T 1 of T[Pk(H)], and let t'^ be the unique vertex of T' with 
minimal d(t\, t'i). Then every vertex of T' lies on the path from some vertex of H to t±, and 
so T' is precisely the set of vertices in paths between t[ and vertices of H n V(T'). Thus 
only t^ and vertices of H can be leaves of T'. It follows that T[P k {H)\ has at most 2\H\ 
leaves. Since T[Vk(H)] is a forest, it follows that the number of vertices of T[Vk(H)] with at 
least two children in T\P k (H)] is also at most 2\H\. Furthermore, any vertex x G T for which 
the vertices of P x were added to Vk{H) at any stage is either a member of H or has at least 
two children in Vk{H). This is true for at most 3\H\ vertices x, and for each such vertex at 
most k vertices were added. □ 

Lemma 4.4. Suppose that 1/m <ti 1/k ^ e <S d. Let T be a directed tree rooted at some 
ti G T. Let H C V(T) be of size \H\ < m/10k, let T be a component of T[V k (H)] which 
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does not contain t\, and let t\ be the unique vertex of T' with minimal d(t' l7 ti). Let G be an 
e-regular d-dense cycle of cluster tournaments on clusters Vx, . . . , Vk, each of size m. Then 
for any j G [k], G contains a copy ofT' with the vertex t\ corresponding to some vertex ofVj, 
and every vertex in V(T') n H corresponding to some vertex ofV±. 

Proof. Informally, from the perspective of t^, T' begins with a path of length k — 1 (from t± 
to t, say) before possibly branching out. So we shall find a copy of T 1 in G by first embedding 
the vertices of this path so that t^ is embedded in Vj and t is embedded in V\. We then embed 
all of the remaining vertices of T 1 in V\. 

More formally, note that for each < s < k — 1 there is precisely one vertex x s of T 1 with 
d(t' l7 x s ) = s (so xo = and Xi ^ H for any i < k — 1). Let F C [k — 1] be the set of 
those s such that x s _i — > x s , and let B C [k — 1] be the set of s such that x s _i <— x s . Then 
\F\ + \B\ = k — 1, so either \F\ > k — j or \B\ > j — 1. Suppose first that \B\ > j — 1. Then 
choose B' C B of size j — 1 . We shall allocate the vertices of T' to the clusters V\ , . . . , Vj . 
Begin by allocating xo to Vj. Then for each s G [k — 1] in turn, let V, be the cluster to 
which was allocated, and allocate x s to VJ if s ^ -B', or to VJ_i if s G B' . Then since 
| B' | = j — 1, will be assigned to V\. Finally, allocate all other vertices of T' to V\. Then 
every edge of T' is allocated either canonically or within a cluster. 

Next we shall embed T' in G so that every vertex is embedded within the cluster to which 
it is allocated. To begin, by a standard regularity argument we may choose for each i a set 
V- C Vi so that | V(\ > 9m/10 and every vertex v £ V- has at least dm/2 outneighbours in V( +l . 
Let G' = G[V(U- ■ - UVl]. Now, for each i, let S{ be the set of vertices of T" allocated to VJ. So 
l^l,...,!^! < k-1 and |5i| < |T'|. Then by Proposition EM 3|5i| < 3|T'| < 9A;|iT| < 
So by Theorem 11.21 we may embed T"[Si] in GfV/]. Now suppose that we have successfully 
embedded T'[Si U • • • U S^-i] in G[V{ U • • • U V(_^\ for some i < j. Then precisely one vertex 
t £ Si has a neighbour G Si-i, and t' has already been embedded to some v' G V^'_i- Now 
has at least dm/2 > 3\S{\ outneighbours in V-, and so by Theorem 11.21 we may embed T'[Si] 
among these outneighbours. Let v be the vertex to which t is embedded; then since v is an 
outneighbour of v' ', we have extended our embedding to an embedding of T'[Si U • • • U Si] 
in G[V{ U • • • U V-]. Continuing in this manner we obtain an embedding of T' in G, with t^ 
embedded in Vj and V(T') \ {xo, • • • ,2:^-2} 5 V(T') n H embedded into V±, as desired. A 
similar argument will achieve this if \F\ > k — j. □ 

4.4. The extended tree. The next lemma combines the ideas of the core tree and leading 
paths to give the structure within a tree T which we shall use to prove Lemma 14.11 It shows 
that given a tree T we may extend the core tree T c of T with parameter A to an 'extended 
tree' T ext which, like T c , has bounded maximum degree (although this bound is now much 
larger than A). T ext will also have the property that only a small subset H of the vertices of 
T cxt have neighbours outside T ext , and that few vertices of T ext are close to a vertex of Vk{H). 

Lemma 4.5. Suppose that 1/n, 1/A* -C 1/A, l/k,oj >C 1. Let T be a tree on n vertices, and 
let T c be the core tree of T with parameter A. Choose any vertex t\ G T c as the root of T . 
Then there exists a subtree T ex t of T and a subset H C V(T ex t) which satisfy the following 
properties. 

(i) T c C T ext . 
in) A(T e ,t) < A*. 

(iii) For any edge e between V{T — T ex t) and V(T ex t), the endvertex of e in V(T ex t) lies 
in H . 

(iv) The number of vertices v G T ex t which satisfy 1 < d(v,Vk(H)) < A; 3 is at most um. 
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(v) \H\ < n/A fcl/ ". 

Proof. We consider the subgraph T — E(T C ) of T obtained by deleting the edges (but not 
the vertices) of T c from T. Each vertex v G T c lies in a separate component of T — E(T C ); we 
denote the component containing v by T v . Then the trees T c and {T v : v G T c } partition the 
edges of T, and the trees {T v : v G T c } partition the vertices of T. 

We say that a vertex v G T c is i-heavy if |T„| > Aj := A fe \ For any integer i, let i?j 
denote the set of i-heavy vertices in T c . So |iTj| < n/Aj, and so by Proposition 14.31 we have 
l'Pfc(-ffj)! < 3/cn/Aj for each i. We wish to choose a large integer t so that few vertices of T 
lie in trees T v for which u is not in Ht but is close to a member of Vk{Ht)- The next claim 
shows that this is possible. 

Claim. For some natural number 1/uj < t < 3/w we have 

(1) | |J T v \<un. 

vdV{T c )\Ht 
d(v,V k (H t ))<k 3 



Proof of Claim. Observe that for each integer i with 1/oj < i < S/oj, if v G V(T C ) \ Hi_i 
then \T V \ < Aj_i, and so 

i II m i i^/^MAfcS+i* 3A;A fc3+1 A i;i " 1 n 3/fcn 

| (J T„| < |P fc (^)|A fc +1 A,_! < — < — — < wn/3. 



i)6V(T c )\i? < _ 1 
d(v,V k (Hi))<k 3 



Now let 

Bi := |J T w . 

d(i«,7' fc (H l ))<fc 3 

Then the sets Bi are pairwise disjoint subsets of V(T). If the claim is false, then \Bi\ > 2wn/3 
for every integer i with 1/oj < i < 3/co, and so | Ui/ W <i<3/ W Bi\ > n ) giving a contradiction. 

□ 



Fix such a value of t, and let H = H t . We define the extended tree T ext by T ext := 
U UveV(T c )\H Tv Then T ex t is a subtree of T with T c C T ext , so (i) is satisfied. Since 
iJ C V(T C ), we have C V(T ex t) as desired. Also (ii) holds since any vertex n G T ext has at 
most A neighbours in T c and at most At neighbours in the single tree T v with v G T c which 
contains u. So A(T ext ) < A + A 4 < A + A fc3/ " < A*. For (iii), observe that if u T exb , then 
u must lie in some T v with v £ H. But then if u has a neighbour in T ex t this neighbour must 
be v. For (iv), consider any u G T ext satisfying 1 < d(u,Vk(H)) < A; 3 . Since d(u,Vk(H)) > 1 
we know that u ^ Ht, so if it G T c then n is counted in ([T]). If u ^ T c then there exists i; 
such that u £ T v and w G V(T C ) \ H. Note that Vk{H) C V(T C ) (since ti G T c ). This in turn 
implies that d{v,Vk{H)) < d(u,Vk(H)) < k 3 . So it is also counted in ([T]). Finally, for (v), 
recall that \H\ < n/A t < n/A fcl/ ". □ 



4.5. Embedding trees of bounded maximum degree with restrictions. In this section 
we shall prove the following lemma, which is similar to Lemma 13.21 but which allows us to 
restrict some vertices of T to a subset of V(G). 
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Lemma 4.6. Suppose that 1/n <C l/A,l/k <C e <C d <C a, A < 1/2, i/iai m = n/k, that 
X < a/4: and that 5 = d\/8k. Let T be a directed tree on n vertices rooted at t\ and with 
A(T) < A. Let H C V(T) be such that \H\ < 5n/7k and \{x G T: 1 < d(x,V k {H)) < k 3 }\ < 
Sri. Let G be an e-regular d-dense cycle of cluster tournaments on clusters V\, ■ ■ ■ ,T4, each 
of size (1 + a)m, and let U C V% U • • • U Vk have size \U\ > An. Then T can be embedded in G 
so that each vertex t G H is embedded to some u £ U. 

Proof. We may assume without loss of generality that \UD V\\ > Am. If t\ ^ H , then add t\ 
to H, so now we have \H\ < 6n/6k. Moreover, the new Vk(H) is the union of the old Vk{H) 
and {t\}. So now 

(2) \{x G T: 1 < d(x,V k (H)) < A: 3 }| < 5n + A fc3+1 < 35n/2. 

Also, introduce a new constant e' with e < <C i To begin, for each i choose disjoint sets 
X% , Y{ Q Vi such that 

• \Xi\ = (1 + a/2)m and = 3Am/4 < am/4, 

• every vertex of Xi U Y{ has at least d\m/2 inneighbours in Yi-\ and at least d\m/2 
outneighbours in l^+i, and 

• YxQUnVx. 

The existence of such sets can be shown by a standard regularity argument. Indeed, choose 
disjoint sets X'^Y! C such that = (l+a/2+d 2 )m, |y/| = (3A/4+d 2 )m and Y"/ C JJnVi. 
Then both G[Y-_ t -)■ X- U y/] and U -)■ are 2e/A-regular with density at least 
3d/4. So all but at most 9em/\ < d 2 m vertices in X^UY- have at least 9dAm/16 inneighbours 
in Y-_ 1 and at least 9dXm/16 outneighbours in Y- +1 . Delete d 2 m vertices from X[ and d 2 m 
vertices from Y- including these d 2 m vertices of small degree (for each i € [A;]). Then the sets 
Xi and Yi thus obtained from X[ and Y- are as desired. 

Each vertex of Vk{H), and every child of any such vertex, will be embedded in the sets Yi, 
whilst the remaining vertices of T will be embedded in the sets Xj. Observe that by Proposi- 
tion !4.3[ \Vk(H)\ < 3fe|i?| < 5n/2. Moreover, ([2]) implies that there are at most 35n/2 children 
of vertices of Vk(H) outside Vk(H). So at most 25n = d\m/A. vertices will be embedded in 
the sets Y; L . 

Next, let T\, . . . , T r be the component subtrees of T[Pk(H)] and T — Vk{H). So each vertex 
of T lies in precisely one of the Tj. Let T con be the tree obtained by contracting each Tj to 
a single vertex i. We may assume the Tj were labelled so that t\ £ T% and 1, 2, . . . , r is an 
ancestral order of the vertices of T con . Then let 

J = {i : Ti is a component subtree of T[Vk(H)]}, 

L = {i : Ti is a component subtree of T — Vk(H) and |Tj| > \/n}, 

Q = {i : Ti is a component subtree of T — Vk{H) and \T%\ < y/n}. 

Note that each vertex of H lies in some Tj such that i £ J. For each i > 1, Tj contains 
precisely one vertex with a neighbour in some Tj with j < i. (Furthermore, if i G L U Q then 
this j must belong to J.) Let ti be this vertex, then the children of vertices of Vk{H) which 
are not in Vk{H) are precisely the vertices ti for i G LU Q. For each i let T/ ar be the set of 
vertices x G T with d(ti,x) > A; 3 . Then 

(3) ^ |F(T)\T/ ar | <3<5n/2 

by ©. Finally, for each i let Tf = T[F(Ti) U • • • U V(Ti)], so 2V- is the graph formed from 
the union of T\, . . . , Ti by also adding the edges between T\, . . . , Tj. 
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We shall use a randomised algorithm to embed the vertices of T in G. At each time r this 
algorithm will embed the vertices of T T . Indeed, if r G J, we will use Lemma [4.41 to embed T T 
in the sets Yj so that the vertices of H n V(T T ) are embedded in Y\ C U. If r G L, we will 
use Lemma 13.21 to embed T T in the sets Xi (except for the vertex t T , which will be embedded 
in some Yi) so that approximately equally many vertices of T T are embedded in each set Xj. 
Finally, if r G Q we will use Lemma 13.51 to randomly embed T T in the sets X{ (again with the 
exception of the vertex t T , which will be embedded in some Yi) so that the expected number 
of vertices of T^ ar embedded in each set X{ is approximately equal. Together the embeddings 
of each Tj in G will form an embedding of T in G such that every vertex of H is embedded 
in U, as desired. At any time r we will be able to choose the desired embedding of T T unless 
there are insufficient vertices remaining unoccupied in one of the sets Xi. We shall show that 
this is unlikely to happen for any i, and hence that with positive probability the algorithm 
will find a copy of T in G, proving the lemma. 

Tree Embedding Algorithm. 

At time r = 1, we wish to embed T\ in G. Recall that we ensured that t\ G H, so 1 G J. 
We shall embed T\ in Yi. Indeed, |Yi| = 3Am/4, and \T X \ < \V k (H)\ < Sn/2 = dXm/16, 
and so Y\ contains a copy of T\ by Theorem 11.21 Choose (deterministically) such a copy, and 
embed each vertex of T\ to the corresponding vertex in this copy. 

So after completing the first step, the algorithm will have obtained an embedding of T\ = 
Tf in G such that any vertex of H n V(T^) is embedded in Yi, and only vertices of Vk(H) 
and their children have been embedded in the sets Yi. 

At a given time r > 1 we may therefore suppose that the algorithm has found an embedding 
of T^_ 1 in G so that each vertex of HC\V(T^_j) is embedded in Yj, and only vertices of Vk(H) 
and their children have been embedded in the sets Yi. (Recall that this implies that at most 
dXm/A vertices are embedded in the sets Yj.) We wish to extend this embedding to include 
T T , and we do this by the following steps. 

• For each i let XJ and Yj consist of the unoccupied vertices of Xi and Yj respectively. 
If \XJ\ < \T T \/k + am/A for some i, then terminate the algorithm with failure. So 
we may assume that \X[\ > \T T \/k + am/A for each i. Also, since at most dXm/A 
vertices have been embedded in the sets Yj, every vertex of Xi U Yj must have at least 
dXm/A inneighbours in Y[_-^ and at least dXm/A outneighbours in YZ_i- 

• By definition, t T is the unique vertex of T T with a neighbour which has already been 
embedded. Let t' T be this neighbour, and let v' T be the vertex to which t' T was embed- 
ded. Also let Vj be the cluster into which t T should be embedded so that the edge 
between t T and t' T is embedded canonically. Then v' T has at least dXm/A neighbours 
in YJ , and so by a standard regularity argument, we may choose some such neigh- 
bour v T G Yj which has at least adm/8 outneighbours in Xj +1 and at least adm/8 
inneighbours in Xj_ 1 . 

• Now, if r G L, for each i consider a set ZJ C XJ of size (1 + a/8)\T T \/k chosen 
uniformly at random and independently of all other choices. We can do this since 
(l + a/8)|r T |/ife < \T T \/k + am/8 < \X[\ for each i G [k]. Then since G[X{U...Xl\ is 
a (16e/a)-regular d/2-dense cycle of cluster tournaments, by Lemma [2?6l G\Z\. . . . , ZJ/\ 
is an e'-regular ii/4-dense cycle of cluster tournaments with probability 1 — o(l). Also 
with probability 1 — o(l), v T has at least ad\T T \/16k outneighbours in Zj +1 and at 
least ad\T T \/16k inneighbours in Zj_ 1 . So we may choose (deterministically) sets ZJ 
satisfying these two properties. Now delete a single vertex (chosen arbitrarily) from 
ZJ, and replace it by v T , and let G T be the restriction of G to the new Z\, . . . , ZJ.. 
Then G T is a (2e')-regular (d/8)-dense cycle of cluster tournaments with clusters 
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of size (1 + a/8)\T T \/k. So by Lemma 13.21 G T contains a copy of T T with at most 
(1 + a/8)\T T \/k vertices of T T embedded in each X{, and with t T embedded to v T . 
Embed each vertex of T T to the corresponding vertex in this copy. 

• If instead r £ Q, then arbitrarily choose Zj C Xj U {v T } of size am/ 8 with v T £ 
Zj, and for each i ^ j choose ZJ C XJ of size am/ 8 uniformly at random and 
independently of all other choices. Then G T := G[Z±,...,ZT] is a (16e/o!)-regular 
(i/2-dense cycle of cluster tournaments. Also, with probability 1 — o(l), v T has at 
least a 2 dm/128 outneighbours in Zj +1 and at least a 2 dm/128 inneighbours in Zj_ l: 
so we may fix (deterministically) our choices of the ZJ such that this event holds. 
Then by Lemma 13.51 the set of copies of T T in G T such that t T is embedded to v T is 
non-empty, and furthermore there exists a probability distribution on this set so that 
if a copy is chosen according to this distribution, then the expected number of vertices 
of T^ ar embedded in each ZJ is at most (1 + y / F)|T. J l : ar |/A;. Choose (deterministically) 
such a distribution, and choose randomly such a copy according to this distribution. 
Embed each vertex of T T to the corresponding vertex in this copy. 

• Finally, if r £ J, then since v' T has at least dXm/A neighbours in Yj~, we may choose 
sets ZJ C YJ , . . . , ZJ C Y/J , each of size dXm/4, so that every vertex of Zj is a 
neighbour of v' T . Let G T be the restriction of G to the sets ZJ; then G T is a (8e/dX)- 
regular (c?/2)-dense cycle of cluster tournaments. Since \H\ < Sn/6k = dXm/48k, by 
Lemma [4.41 G T contains a copy of T T , with vertex t T embedded in YJ , and with every 
vertex of H n V(T T ) corresponding to a vertex of YJ. Embed each vertex of T T to the 
corresponding vertex in this copy. 

• In either case, we have extended the embedding of T~_^ in G to an embedding of 

in G, such that every vertex of H n V{T^) is embedded in Y\ C 17, and only vertices 
of Vk{H) and their children have been embedded in the sets Y{. 

Since T r - = T, if the algorithm does not terminate with failure then at time r, after 
embedding T r it will have obtained an embedding of T in G so that every vertex of H is 
embedded in U, as desired. At this point the algorithm terminates with success. 

It remains to show that with positive probability this algorithm will not terminate with 
failure before embedding T r . Suppose first that SjeQ I^J'I < am/8. Then for any i £ [k] and 
at any time r, the number of vertices embedded in Xi is at most 

l + a/8 ^ ^ (1 + a/8)(n - |r r |) am / ax |T T | 

and so > \T T \/k + am/ '4. Therefore the algorithm cannot terminate with failure at any 
point. So we may assume that SjeQ l-^il — am /&- 

Let OUT be the set of all possible courses of the algorithm until termination. Since the 
only random choices made by the algorithm are the choices of where to embed the Tj for 
each i £ Q, any possible course of the algorithm C 6 OUT can be uniquely described by the 
embeddings /j of into G for each i £ Q such that the algorithm does not terminate before 
embedding Tj. So we may define a probability space with sample space OUT where for any 
C £ OUT, ¥(C) is defined to be the probability that the algorithm takes course C. So 

F(C) = Y[P(F 1 \F i :i<j,i£Q). 

where Fj denotes the event that fj is the embedding of Tj into G, if Tj is embedded at some 
point during C, and is taken to be true otherwise. 
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Now, we define the random variable Wj in this probability space as follows. For any 
C G OUT,j G Q and i G [k], let 

# of vertices from T? ar embedded in X, 

-7= if Tj is embedded during C, 

I y far I 

— — otherwise. 



wm 



Since |Tj ar | < \Tj\ < yfn for each j G Q, Wj is a well-defined function from OUT to [0, 1], 
and so is a well-defined random variable in our probability space. 

For any j G Q and C a ,Cb G OUT, let C a ~j Cj, if and only if C a and Cb share the same 
course before time j (i.e. they embed T\, . . . , identically) or Tj is not embedded at any 
point in either C a or Cb- Then ~j is an equivalence relation on OUT (since if two courses 
agree up to time j — 1, then at time j either they both terminate with failure or they both 
successfully embed Tj). For any equivalence class C* of ~j other than the class of C for which 
Tj is not embedded, every C G C* shares the same course before time j. So for each C G C* , 
the same probability distribution on the set of copies of Tj will have been chosen at time j, 
and a copy will then have been chosen according to this distribution. So further partition 
C* into C*, . . . , C* by this choice, so courses C,C G C* are in the same C* if and only if Tj is 
embedded identically in C and C. Now 

E(Wj I C*) =J2 E ( W j I c *sMC*s I O, 

s 

but every member of C* embeds Tj identically, so E(W- | C*) is simply the number of vertices 
of Tj ar embedded in in this common embedding, divided by \Jn. Also, P(C* | C*) is the 
probability that this embedding of Tj is chosen when the random choice of the embedding 
of Tj is made. So by our (deterministic) choice of the probability distribution on the copies 
of Tj in G, 

(1 + ^)\T^\ 

(4) E(W? I C*) < - 1 



ky/n 

If instead C* is the class of all C such that Tj is not embedded in C, then E(W? | C*) = 
\Tj T \/k^/n by definition, and so (j4|) holds in this case also. 

Now, for any equivalence class C* other than the class in which Tj is not embedded, the 
embeddings of Tj, . . . , Tj_i are identical amongst the members of C*, and so 

E(W] I C* , Wl : s G Q, s < j) = E(W] I C* ). 

Clearly this equality also holds for the class C* in which Tj is not embedded, and so for any 

i G [k], 

(i + V^)|7t ar l 

E(W* \W l s : s G Q, s < j) < maxE(W* | C* , W l s : s G Q, s < j) < , l! 3 . 

J c J kyjn 

Since EjgQ \Tj\ > am/8, by Lemma 12.21 f° r an y i the probability that 

„ (l + al$)Y li eo\T j \ 

jeQ v 

does not hold decreases exponentially with n. So with probability 1 — o(l), © holds for each 
i G [A;]. 

To finish the proof, we show that if © holds for each i G [k], then the algorithm cannot 
terminate with failure, and will therefore successfully embed T in G as desired. Indeed, the 
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algorithm will only terminate with failure if at some time r we have \X[\ < \T T \/k + am/A for 
some i. But for any i £ [k] and any time r, only vertices from subtrees T s such that s £ LL)Q 
and s < t have been embedded in Xj before time r. So the number of vertices embedded 
in Xi before time r is at most 

{1± ^ B - E E \v(t s )\t!-\ + ^ e wi 

seL\{r} s£Q\{t} s£Q\{t} 

f (l + a/8) ^ ITtL,, a |T T | 

<^^- E |T.|+2fo-_ <(l + I )m-_. 

To see that the second line holds, note that \T T \/k < sjn/k < Sn/2 whenever t £ Q and 
\T T \/k < \Vk(H)\ < 5n/2 whenever r E J. So if ([5]) holds, then at any time r and for any 
i G [A;], |AJ| > |T T |/fc + am/4, and so the algorithm succeeds. This completes the proof of 
Lemma 14.61 □ 



4.6. Proof of Lemma 14.11 We can now give the proof of Lemma 14.11 which will proceed 
as follows. We shall apply Lemma 14.51 to find a subtree T ex t of T and a subset H C V(T ex t). 
Then we shall find a cluster cycle C in G such that |C| is slightly larger than |T ext |. We then 
embed T ext into C using Lemma [4. 61 restricting H to a set ?7 of vertices of C which have many 
inneighbours and outneighbours outside C. Finally we shall use this property of U to embed 
the vertices of T — T ext in V(G) \ V(C) and thereby complete the embedding. 

If a > 1/2, then G contains a copy of T by Theorem 1 1.2 1 So we may assume that a < 1/2. 
We begin by introducing new constants A*, M, M', 8, e, d and A with 

l/n < 1/A* <1/M <C 1/M' <e<(i</^<^<??«; 1/A < a. 

Then Lemma 12.71 implies that G contains an e-regular d-dense cycle of cluster tournaments 
on clusters Vi, . . . , V/., where M' < k < M and each cluster has equal size between (2 + a)m 
and (2 + 2a)m, where m = n/k. Also let 

5 := da 2 /16000/c. 

Remove vertices from each Vi to obtain a 2e-regular <i/2-dense cycle of cluster tournaments 
G' on clusters V[, . . . , V{. each of size (2 + a)m. 

Let T c be the core tree of T with parameter A, and choose any vertex t\ £ T c as the root 
of T. Then by Lemma [431 (applied with to = 5a/ 50), we may choose a subtree T ext of T and 
a subset H C F(T ext ) satisfying the following properties. 

(i) T c C T ext . 

(ii) A(T cxt ) < A*. 

(iii) For any edge e between V(T — T ext ) and V(T ext ), the endvertex of e in V(T ex t) lies 
in H. 

(iv) The number of vertices v £ T ext which satisfy 1 < d(v,Vk(H)) < /c 3 is at most 5an/50. 

(v) \H\ < n/A k50/Sa < 5an/350k. 

Let T x + , . . . , T r + and T x _ , . . . , T~ be the component subtrees of T — T ext . Each and 
will contain precisely one vertex, vf or v~ respectively, with a neighbour in T ext . Label the 

, . . . ,T r + and T-f , . . . , T~ so that each contains vf with an inneighbour in T ext , and 
each contains v~ with an outneighbour in T cxt . By (i) and Proposition I4.2( iv) each Tf~ 
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and each T~ contains at most n/A vertices. Let x = |T ex t|, let y = |T 2 U • • • U T r + | and let 
z = \T^~ U • • • U T~ | , so x + y + z = n. 

Then all but at most 2y + x + an/2 vertices of G have at least y + x/2 + an/4 outneighbours 
in G, and all but at most 2z + x + an/2 vertices of G have at least z + x/2 + an/4 inneighbours 
in G. So at least 2(1 + a)n — 2y — 2z — 2x — an = an vertices of G satisfy both of these 
conditions. Choose any an/8 of these vertices to form Uq. Then \Uq\ = an/8, and each 
v G Uq has at least y + x/2 + an/ 8 outneighbours outside Uq and at least z + x/2 + an/ 8 
inneighbours outside Uq. 

Suppose first that x > an/50. From each cluster V( of G' choose a set X{ of x(l + a/2)/k 
vertices uniformly at random, and let X = X\ U • • • U X^. Then \X\ = x{\ + a/2), and 
for any single vertex u G G' , the probability that u is included in X is equal to x/2n. So 
by Proposition 12.31 with probability 1 — o(l) the set U := X n Uq satisfies \U\ > ax/20 > 
a 2 n/1000. Also, for any vertex v G U, the expected number of outneighbours of v outside X 
is at least 



A similar calculation shows that for each v G U, the expected number of inneighbours of v 
outside X is at least z + an/16. So by Proposition 12.31 we find that with probability 1 — o(l), 
every vertex v G U has at least y + an/ 20 outneighbours outside X and at least z + an/ 20 
inneighbours outside X. Fix a choice of X such that both these events of probability 1 — o(l) 
occur. 

Since every vertex of U has either at least (2(1 + a)n — \X\)/2 > y + z + x/2 + an/2 > 
y + z + an/ 2 inneighbours outside X or at least y + z + an/2 outneighbours outside X, we 
may choose a set U' C U of size 

1*7' I > \U\/2 > a 2 n/2000 such that either 

(a) every v G U' has at least y+ an/ 20 outneighbours outside X and at least y + z + an/20 
inneighbours outside X, or 

(b) every v G U' has at least y + z + an/20 outneighbours outside X and at least z + an/ 20 
inneighbours outside X. 

So G't-X] is a (150e/a)-regular (d/2)-dense cycle of cluster tournaments on clusters X\,..., Xj~ 
each of size (1 + a/2)x/k, and U' C X x U . . . X fc has size \U'\ > a 2 n/2000 > a 2 x/2000. Also 
by (ii), (iv) and (v) we know that T oxt is a directed tree on x vertices rooted at t% and 
with A (T ex t) < A*, and also that H C V(T ex t) satisfies < 5an/350k < 5x/7k and 
|{i G T ext : 1 < d(t,V k (H)) < k 3 }\ < 5an/50 < 5x. So by Lemma 031 (with A* in place 
of A and a 2 /2000 in place of A), we may embed T ext in so that every vertex of H is 

embedded to a vertex of U' . 

Now suppose instead that x < an/50. Then, since every vertex v of G has either d + (v) > 
(1 + a)n — 1 > y + z + an or dT (v) > (1 + a)n — 1 > y + z + an, we can choose a 
set U' C C/q of size |f7'| > an/16 which satisfies either (a) or (b) (with X := U'). Since 
I Text | = x < an/50 < |J7'|/3, and G[U'] is a tournament, by Theorem II. 21 we may embed T ex t 
in Gr[J7'], so in particular every vertex of H is embedded to a vertex of U'. 

In either case, let V ex t be the set of vertices of G to which T ex t is embedded. We may 
now complete the embedding of T in G. If U' satisfies (a), then we first proceed through the 
trees in turn. For each T^ + , let uf be the inneighbour of vf in T ext (so uf G H by (in)). 
Then uf has been embedded to some vertex v G U' . This v G U' has at least y + an/20 
outneighbours outside V ext , of which at most y have been used for embedding the trees T+ 




/ i \ an 
+ V ~ 2n~) ~8~ 
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for j < i. So there are at least an/20 outneighbours of v outside V ex t available to embed T { , 
and so since < n/A < an/60, by Theorem O we can embed among these vertices. 
In this way we may embed each of the . We then proceed through the similarly. For 
each T~ let be the inneighbour of v~ in T ext (so G H by (hi)). Then u~ has been 
embedded to some vertex v G U' . This v G U' has at least y + z + an/20 inneighbours outside 
V ex t , of which at most y + z have been used for embedding the trees T± , . . . , T+ and the trees 
TJ for j < i. So there are at least an/20 inneighbours of v outside V ex t available to embed 

, and so since |T~| < n/A < an/60, again by Theorem ll.2l we can embed among these 
vertices. If U' satisfies (b) we can embed T similarly, first embedding the T~ , and then the 

. Either way we have completed the embedding of T in G. □ 



5. Embedding trees in an almost-transitive tournament. 

A transitive tournament is a tournament in which the vertices can be given a total order 
so that every edge is directed towards the endvertex which is greater in this order. It is easy 
to show that any transitive tournament G on n vertices contains any directed tree T on n 
vertices, by first showing that the vertices of T can be given a total order so that every edge 
is directed towards the endvertex which is greater in this order, and then embedding each 
vertex of T to the vertex of G in the corresponding position (in the order of vertices of G). 

In this section, we shall prove an approximate version of this result, namely that if a 
tournament on (1 + a)n vertices (for some small a) is sufficiently close to being transitive, 
then it contains any directed tree on n vertices. To state this lemma precisely, we say that a 
tournament G on n vertices is e- almost-transitive if the vertices of G can be given an order 
v\, . . . ,v n so that at most en 2 edges are directed against the ordering of the vertices, that is, 
they are directed from vi to Vj where i > j. 

The proof of this lemma is by a similar method to the proof of Theorem 11.41 in the next 
section. The approach is that if the lemma is false, then there is some a > for which the 
lemma does not hold, and so the infimum a; n f of all a for which the lemma does hold is 
greater than zero. We then choose a slightly less than ai n f and apply (to a smaller subtree) 
the fact that the lemma holds for any a' > a; n f to show that the lemma holds for a, giving a 
contradiction. 

Lemma 5.1. For all a > there exists eo > and no £ N such that for any e < £q and 
any n > no, any e- almost-transitive tournament G on at least (1 + a)n vertices contains any 
directed tree T on n vertices. 

Proof. We consider the set A of all positive values of a such that the lemma holds. More 
precisely, A is the set of all positive values of a such that there exist eo > and no G N so 
that for any n > no and e < eo, any e-almost-transitive tournament G on at least (1 + a)n 
vertices contains a copy of any directed tree T on n vertices. So if a' G A and a" > a' then 
a" G A. Also 2 G A by Theorem 11.21 and so we may define ai n f = inf A, with < a; n f < 2. 
Then for any a' > a^, a' G A. With this definition the lemma is equivalent to the statement 
that a in f = 0, so suppose for a contradiction that a- m f > 0. Let 

7 <C 1/A <C a in f and a = a in f — 7, 

so we may assume that 1/A<a. Then a + 27 > a; n f, so a + 27 G A, and so by definition 
of A there exist e > and n G N such that for any e' < e' and n' > n' , any e'-almost- 
transitive tournament G on at least (1 + a + 2^)n' vertices contains a copy of any directed 
tree T on n' vertices. Moreover, we may assume that e <C 7. Fix such an e' and n' , and let 
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1/no <C l/n' ,7 and eq <C e' . We will show that for any n > no and e < £o, any e-almost- 
transitive tournament G on at least (1 + a)n vertices contains a copy of any directed tree T 
on n vertices. It then follows that a € A, yielding a contradiction and therefore proving the 
lemma. 

So let e < £o an d n > no, let G be an e-almost-transitive tournament on at least (1 + a)n 
vertices and let T be a directed tree on n vertices. If |G| > 3n, then G contains a a copy 
of T by Theorem 11.21 an d so we may assume that |G| < 3n. Since G is e-almost-transitive, 
we may order the vertices of G as vi, . . . , viq\ so that at most e|G| 2 < 9en 2 edges are directed 
from Vj to Vi where i < j. Now, at most 18y/en vertices of G are incident to more than \J~en 
such edges; let G' be the subgraph of G obtained by deleting these vertices from G, and let 
v[,v' 2 , . . . be the vertices of G' in the inherited order. Then G' is a tournament on at 

least (1 + a — 18yfe)n vertices such that for any vertex v\ there are at most \fen vertices v'j 
for which the edge between v\ and t/- is directed towards v' ■ r • -i • 

Next, let T c be the core tree of T with parameter A, as defined in Section T4.2I We consider 
three possibilities for T c , in each case showing that T can be embedded in G'. 

Case 1: Some vertex t G T c has d^ c (t) > 2. Then let F~ be the (possibly empty) forest 
consisting of each component subtree T' of T — t such that the edge between T' and t is 
directed towards t. Similarly let the component subtrees T" of T — t such that the edge 
between T" and t is directed away from t be partitioned into two forests, iv" and F 2 . Since 
df (t) > 2, by Proposition 14.2( h) at least two such component subtrees each contain at least 
n/A vertices, and so we may choose F^ and F 2 + so that \F^\,\F^\ > n/A. Note that 
\F~\ = w~(t), and \F^~\ + {F^l = w + (t), so in particular w + (t) > 2n/A, and also recall that 
w + (t) +w~(t) = n-l. 

We first determine where to embed the vertex t. For this, let 

J 37n + ^fen + 1 if w~ (t) < jn, 

P '~ \ (1 + a + 2j)w~ (t) + yfen + 1 if w~ (t) > -yn. 

and embed t to the vertex v' p of G' . This can be done, as we shall see later that p < \G'\. 
We will embed F~ in the vertices preceding v' and F x + , F 2 in the vertices succeeding v' p 
in the vertex ordering of G' . Embedding F~ will be possible because p is a little larger 
than one might expect, whereas embedding F^~ and F 2 + can be done successively, which 
will give us enough room for both. Let S~ = N~(v' p ) n {v[, . . . ,v' p _{\, and S + = N + (v' p ) n 
{v' p+1 , . . . , Then S~ and S + are disjoint, l-S - ] > p — ^fen — 1 and > \G'\—p— y/en. 

Next we shall embed F~ in CfS 1- ]. Indeed, if w~(t) < then \S~\ > 3~/n, and so by 
Theorem 11.21 we can embed F~ in Alternatively, if w~(t) > 771, let n' = w~(t) > n' 

and e 1 = \G\ 2 e/(n') 2 < e' , then F~ is a forest on n' vertices, and is an e'-almost- 

transitive tournament on at least (1 + a + 27)71' vertices. So by the choice of e' Q and n' Q we 
can embed F~ in G'[S~}. 

Finally we shall complete the embedding of T in G' by embedding F^~ and F 2 + in G / [S' + ]. 
Now, 

\S + \ > \G'\ -p-Jin 

> (1 + a - 18y/e)n - (3>yn + (1 + a + 27)^" (t) + y^n + 1) - yfen 

> (1 + a)w + (t) - 57n - 20^^ > (1 + a)w + (t) - 6jn. 

Let n' = \F^\, so n' < n/A < n' and n' < w + (t) — n/A, and again let e' = \G\ 2 e/(n') 2 , so 
e' ^ e 'o- Then G'fS 1 " 1 "] is an e'-almost-transitive tournament on > (1 + a)(n' + n/A) — 
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6777. > (1 + a + 1/A)n' + (a/A — 67)71 > (1 + a + 27)72' vertices, and so by our choice of n' 
and £q, we may embed F± in G"[S + ]. 

Now, let S+ m consist of the vertices of S + not occupied by the vertices of F±~. We shall 
embed F^ in S^ era in a similar manner. Indeed, we now let ri = \F+\, so again n < n/A < n' , 
and again let e' = \G\ 2 e/{n') 2 < e' . Then 

KeJ = \S + \ ~ \F+\ > (1 + a)w + (t) - 6 7 n - (w + (t) - \F+\) 

= (1 + a)n' + a\F+\ - 6771 > (1 + a + 2 7 )n', 

so G'fS'+jJ is an e'-almost-transitive tournament on at least (1 + a + 27)71' vertices, and so 
by our choice of n' and e' , we may embed F 2 + in G'[S+ m ]. 

Cose Some vertex t G T c /ias (t) > 2. Then we may embed T in G" by the same 
method as in Case 1, the main difference being that the roles of outdegrees and outneighbours 
are switched with those of indegrees and inneighbours. 

Case 3: T c is a directed path (possibly consisting of just a single vertex). Then let 
u>+ = w + (T c ) and w~ = w~(T c ) be as defined in Section [21 and partition the vertices 
of G' into three sets S~ = \v\ , . . . , v' _ /Q ), S = -|V„_ , v',^,, + /Q | and 

Li' ' w +cm/3 J ' L w +an/d+l , ' |G |— w + — an/S 1 

S + = { v '\G'\- w +-an/3+v •• • ' V '\G'\}- Tnen since w+ + w ~ + l^cl = w e know that |5| = 
\G'\ — w~ — w + — 2an/3 > \T C \. Therefore by Theorem 11.31 we may embed T c in Now, 
let T+, . . . , T r + be the component subtrees of T — T c such that the edge between and T c 
is directed towards T^, and for each i let tf G T c be the vertex of T c to which this edge is 
incident, and let vf be the vertex of G' to which tf was embedded. Similarly, let T-j - , . . . ,T~ 
be the component subtrees of T — T c such that the edge between and T c is directed to- 
wards T c , let t^ be the vertex of T c to which this edge is incident, and let be the vertex 
of G" to which t^ was embedded. Then every vertex of T lies in T c or one of the or . 
Furthermore |, < n/A for each 7 and j by Proposition I4.2( iv) . 

We shall complete the embedding of T in G" by greedily embedding each in N + (vf' )r\S + , 
and each T^ - in N~ (v~ )n S~ . Indeed, suppose we have already embedded Tf , . . . , Tf 1 , and 
we now wish to embed T^f . Then 

\N + (v+) n 5 + | > - Jen > w + + an/3 - ^fsn >w + + an/4,. 

At most w + of these vertices have already been occupied by vertices of T± , . . . , and so 
there remain at least an/ 4 available vertices in which to embed T^~ . Since \Tf~\ < n/A < 
an/12, we may embed in these available vertices by Theorem ll.2[ Continuing in this way 
we may embed all of the T^~, and the Tf may be embedded similarly, to give us a copy of T 
in G'. 

Any tree in which every vertex has at most one outneighbour and at most one inneighbour 
is a directed path. So T c must fall into at least one of the three cases, and so we can find 
a copy of T in G', and hence in G, contradicting our assumption that ai n f > 0. So we must 
have ainf = 0, and so the lemma holds. □ 

6. Proof of Theorem 11.41 
Recall the statement of Theorem 11.41 

Theorem 11.41 Let a > 0. Then the following properties hold. 

(1) There exists no G N such that for any n > no, any tournament G on at least 2(1 + a)n 
vertices contains any directed tree T on n vertices. 
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(2) Let A be any positive integer. Then there exists no £ N such that for any n > no, any 
tournament G on at least (1 + a)n vertices contains any directed tree T on n vertices 
with A(T) < A. 

The proofs of each of the two statements of the theorem are very similar, so to avoid 
repetition we shall prove the first statement, explaining in footnotes where the proof of the 
second statement differs. 

6.1. Partitioning the vertices of G. As in the last section, we consider the set A of all 
positive values of a such that the theorem holds. So a' E A if and only if there exists no such 
that for any n > no, any tournament on at least 2(1 + a')n vertices contains any tree on n 
vertices. So if a' £ A and a" > a' then a" £ A, and also 1/2 £ A by Theorem 11.21 Thus we 
may define a m t = inf A, and then the theorem is equivalent to the statement that di n f = 0. 
So suppose cii n f > 0, and choose constants 

1/no < l/n'o < u < v < n < 1/A' < 7 < a inf . 

Let a = ainf— /x, so a < 1/2, and we may assume that 7^0. Then a + 2/i E A, and so for any 
n ' > n' , any tournament on at least 2(1 + a + 1\i)n' vertices contains any tree on n' vertices. 
We shall prove that if n > no, any tournament G on at least 2(1 + a)n vertices contains any 
tree on n vertices. This proves that a £ A, giving a contradiction to our assumption that 
a inf > 0, and so proving the theoremQ 

So let G be a tournament on at least 2(l+a)n vertice^. If |G| > 3n then by Theorem II. 2 \ G 
contains any directed tree T on n vertices. So we may assume \G\ < 3n. We shall use an 
algorithm which keeps track of an ordered family S T of disjoint subsets of V(G), and a set B T 
of bad edges of G, at each time r. Initially, let 5 1 = (V(G)), and let B 1 = 0. Then at time 
r > 1, we have 5 r = (S{, . . . , S^), and the algorithm proceeds as follows. 

(1) Let SJ be a largest member of S T . If \SJ\ < jn, then terminate. 

(2) If G[SJ] is a robust (u, i/)-outexpander with 5°(G[SJ]) > nn, then terminate. 

(3) If some v S SJ has dg^ ST ^(v) < nn, then let 

= (si, su, sj \ w, M, sj +1 s;), 

let B T+1 = B T U E({v} -4 SJ \ {v}), and proceed to step (6). 

(4) Similarly, if some v £ SJ has fl^gr] («) < nn, then let 

= (si, . . . , sj-_ l9 w, sj \ M, sj +1 ,. . . , s;), 

let B T+1 = B T U ^(S^ \ {v} -4 {«}), and proceed to step (6). 

(5) If G[Sj] is not a robust (/i, z^)-outexpander then apply Lemma f2.8l to partition the 
vertices of Sj into sets S' and S" such that v\Sj\ < \S'\,\S"\ < (1 — ^l^jl and at 
most 4a\Sj\ 2 edges of G[Sj] are directed from S" to S' . Then let 

CT+l / QT QT Qf Qtf QT Q T \ 

O — {Ji , . • • , i~>£_i, J , J , lJ£+l, ■ ■ ■ , J T ) 

^For the bounded degree case, fix any value of A, and here A — A(A) is defined by a' £ A if and only if 
there exists no such that for any n > no, any tournament on at least (1 + a')n vertices contains any tree T 
on n vertices with A(T) < A. So if a' G A and a" > a' then a" G A, and also 2 G 4 by Theorem [O] Thus 
we may define ainf = inf A; then the theorem is equivalent to the statement that a[ n t = 0. So suppose ai n f > 0, 
and choose constants 1/no ^ V n o ^ v ^ rj <gi 1/A' <C 7^ 1/A, a] n f. Let a = ai n f — u, so a < 2, and 
we may assume that 7<a. Then a + 2/i G A, so for any n' > n , any tournament on at least (1 + a + 2^i)n' 
vertices contains any tree T on n' vertices with A(T) < A. Using this, we shall prove that if n > no, any 
tournament G on at least (1 + a)n vertices contains any tree T on n vertices with A(T) < A. This proves 
that a £ A, giving a contradiction to our assumption that Oi n { > 0, and so proving the theorem. 

2 For the bounded degree case, instead let G be a tournament on at least (1 + a)n vertices. 
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and let B T+l = B T U E(S" S'). 
(6) Finally, for each i £ [r + 1], delete from SJ +1 any vertex v which lies in more than 
^fr\n edges of B T+1 . 

At any step, if the algorithm does not terminate at step (1) or (2), then the condition 
of one of steps (3), (4) and (5) must hold. Therefore at each time r, either the algorithm 
terminates or |5 r | increases from r to r + 1 (in forming S T+1 ) by reducing the size of the 
largest piece. Therefore the algorithm must terminate at some time T en d 

< |G| < 3n. 

Now, at any time r at which the algorithm does not terminate, the algorithm will split 
the set SJ in precisely one of steps (3), (4) and (5). We next show that the split in step 

(5) will occur for at most times r < T cn d. This is because any set obtained by a split 
in step (5) must have size at least r yun (since \SJ\ > jn, and the sets S',S" obtained have 

\S"\ > v\Sg\), and so at most \G\/^un < such sets can be obtained. 
Next, we show that when the algorithm terminates at time T en <i, most vertices lie in one 
of the sets SJ , or equivalently that only a few vertices have been deleted. To do this, note 
that at each time r < r en ^, the number of edges added to form B T+l from B T is at most 
rjn if the algorithm carried out the split in step (3) or (4), and at most 4/i|G| 2 < 36 fin 2 if 
the algorithm carried out the split in step (5). Since r on( j < 3n, and the split in step (5) is 
carried out in at most of these steps, the number of bad edges at time r en( j is at most 

3nn 2 + 108/in 2 /vj < Ann 2 . Since B 1 C • • • C B Tcnd , any vertex of G which was ever deleted 
in step (6) must lie in at least Jr\n edges of B Tcnd , and so at most 8^/rjn vertices of G can 
have been deleted in step (6) over the entire course of the algorithm. Let G' be the restriction 
of G to the undeleted vertices at time r en d, so G' = G[|J 5 Tcnd ]. Then G' is a tournament and 

(6) \G'\ > \G\ - 8^rjn. 

Our approach now depends on whether the algorithm terminated in step (1) or (2). If the 
algorithm terminated in step (1), then for each i G [T cn d] we have |S'J end | < jn. We shall show 
that in this case G' is 27-almost-transitive. Indeed, order the vertices of G' as v\, V2, ■ ■ ■ , v \g'\ 
in the same order as in 5 Tend , i.e. beginning with all the vertices of S'{ ind , then the vertices 
of 5 < 2 end , and so forth. Then any edge Vj — > V{ where j > i either lies in B Tend or has both 
endvertices in the same S^ cnd . So the total number of such edges is at most 

A V n 2 + £ S 2 l ) < A V n 2 + £ ^ < A V n 2 + ^ < 2 7 n 2 . 

Since in both the unbounded degree case and the bounded degree case we have 

\G'\ > (l + a/2)n, 

by ©, G' is indeed 27-almost-transitive, and by Lemma 15. II G' contains a copy of T, which 
is also a copy of T in G. 

6.2. Partitioning the vertices of T. We may therefore assume that the algorithm termi- 
nated in step (2) at some time r en d; when for some S , J cnd with |S , J cnd | > jn, G[Sj end ] is a (//, v)- 
robust outexpander with 6° (G[S'[ ond ]) > nn. For this i, let S = Sl crLd , let S + = Ui<j< Tcnd S j cnd 
and let S~ = Ui<i<i ^J cnd - Then \S + U S~ U S\ = \G'\. Also, if u G 5+ and v G S U S~ then 
u G Sj cnd , v G Sp cnd for some j > £, and so if u — > v then this edge is in B Tcnd . So any vertex 
u G S + has at most ^/rjn outneighbours in S U S~ , since u was not deleted at any stage of the 
algorithm. Similarly each vertex of S has at most ^pr)n outneighbours in S~ and inneighbours 
in S + , and each vertex of S~ has at most ^/rjn inneighbours in S + U S. Define by 
\S\ = (3\G'\, \S+\ = [3 + \G'\, and \S~\ = P~\G'\, so [3 + /3+ + (3~ = 1 and (3 > jn/\G'\ > 7/3. 
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Suppose first that /3 + and /3~ are both small. More precisely, /3 + ,/3~ < a/3 2 /20, and so 
/3 > 1 — a/10. Then we shall find a copy of T in G[S] (and therefore in G). Indeed, T is a 
tree on n vertices, and G[S] is a (//, i^)-robust outexpander with (5°(G[S']) > r/n. Furthermore, 

G3 a a 

\S\ = /3\G'\ > (2 + 2a - 80?)/3n > (2 + a)(l - —)n > 2(1 + -)n 

and so by Lemma H~T1 G[5] (and therefore G) contains a copy of T0 

So we may assume that at least one of /3 + and /3~ is greater than a/3 2 /20, so in particular, 
(3 < 1 — a/3 2 /20. We next split the vertices of T according to the values of /3 + and /3 _ . 

Case 1: (3~ is large but /3 + is small. More precisely, j3 + < a/3 2 /20 and /3~ > a/3 2 /20. 
Then we partition the vertex set of T into T~ and T°, where every edge of T between T~ 
and T° is directed from to T , and |T~| = /3~(1 — a/3)n. We can form T° greedily by 
successively removing a sink vertex from T and adding it to T°. Since /3 + + /3 + /3~ = 1, 

|T°| = n - \T~\ = (3n{l + a - a/3) + (1 - a/3)/3 + n < /3n(l + a - a/3) + a/3 2 n/20. 

Cose 2; /3 + is fer^e but f3~ is small. More precisely, /3~ < a/3 2 /20 and /3 + > a/3 2 /20. Then 
we similarly partition the vertex set of T into T° and T + , where every edge of T between T° 
and T + is directed from T° to T + , and |T + | = /3 + (l — a/3)n. Again |T°| = n — |T + | < 
/3n(l + a - a/3) + a/3 2 n/20. 

Cose 5: /3 + and /3~ are 6oia /aroe. More precisely, /3 + ,/3~ > a/3 2 /20. Then we partition 
the vertex set of T into pieces T~, T° and T + such that all edges of T between T~ and T° 
are directed from T~ to T°, all edges of T between T° and T + are directed from T° to T + 
and all edges of T between T~ and T + are directed from T~ to T + . Also |T + | = /3 + (l — a/3)n 
and |T - | = /3"(1 - a/3)n, so |T°| = /3(1 + a - a/3)n. 

Note that in each of the three cases T° satisfies \T°\ > /3(1 + a - a/3)n and 

(7) \T°\ < /3(1 + a - ap)n + a/3 2 n/20 < 0(1 + a)n - ^y^. 

6.3. Embedding T in G. Having partitioned the vertices of G' into three sets S, S + and S~ , 
and the tree T into three forests T + , T°, T _ , we now complete the proof by embedding T in G, 
with T _ ,T° and T + embedded in GfS 1- ], and G[5 + ] respectively. Indeed, the fact that 
G[S] is a robust (fi, z^)-outexpander will enable us to embed slightly more vertices in G[S] than 
the j3n that would be embedded in G[S] if the vertices of T were distributed proportionately 
amongst G[S], G[S + ] and GfS 1- ]. This gives us some leeway for embedding T + and T~ in 
G[5 + ] and Gf/S - ] respectively, which by our choice of a is sufficient to successfully complete 
these embeddings. 

So let , . . . , T~ be the component subtrees of T~, let . . . ,T+ be the component 
subtrees of T + , and let T\,...,T Z be the component subtrees of T . Let the contracted 
tree T con be formed from T by contracting each T^", and Ti to a single vertex. 

To begin the embedding, we embed into G[S] every Tj satisfying \Ti\ > n/A'. Note that 
there are at most A' such Tj. Also, the union of all such Tj is a forest on at most \T°\ vertices, 
and the tournament G[S] is a robust (//, z/)-outexpander on 

/3|G'| > /3(2 + 2a - 8^j)n > 2 1 + |T°| > 2(1 + 7 2 )|T°| 



El 

^For the bounded degree case, |S| = /3\G'\ > (1 + a - 8y/ff)(l - a/10)n > (1 + a/A)n, and so G[S] (and 



therefore G) contains a copy of T by Lemma |3.1 
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vertices with dP(G[S]) > r/n, and hence G[S] contains a copy of this forest by Lemma l4.lFl 

Now, choose an order of the vertices of T con , beginning with the at most A' vertices cor- 
responding to the T which we have just embedded, and such that any vertex of T con has 
at most A' neighbours preceding it in this order. (To do this, choose one of the A' vertices 
corresponding to the Tj which have already been embedded, and then choose any ancestral 
ordering of the vertices of T con , beginning with the chosen vertex, so every vertex has at most 
one neighbour preceding it in this order. Now move the remaining A' — 1 vertices corre- 
sponding to the Tj which have already been embedded to the front of this order; then every 
vertex gains at most A' — 1 preceding neighbours.) We shall proceed through the remaining 
vertices of T con in this order, at each step embedding the tree Ti,T^ or Tf corresponding to 
the current vertex of T con in the unoccupied vertices of the tournament G[S], G[S + ] or G[S~] 
respectively. 

So suppose first that the current vertex t* of T con corresponds to some Tj. Since T has not 
already been embedded, we know that |T| < n/A'. Also, since t* has at most A' neighbours 
preceding it in T con , the vertices of T have at most A' neighbours outside T which have 
already been embedded. Since T is a component of T°, each of these neighbours of vertices 
in T lies either in T~ (in which case it is an inneighbour) or in T + (in which case it is an 
outneighbour). So let £7, ... , t~ be the vertices in T" which are inneighbours of some vertex 
in T and which have previously been embedded, and let v± , . . . ,v~ be the vertices of CfS' - ] 
to which £T,...,tp were embedded. Similarly, let tf,...,t+ be the vertices in T + which 
are outneighbours of some vertex in Tj and which have previously been embedded, and let 

, . . . , id" be the vertices of G'[S + ] to which t^, . . . , i+ were embedded. Finally let S* be 
the set of unoccupied vertices in S n N + (v^ , . . . , v~) n N~(vf, . . . , Then we wish to 
embed Tj in S*. For this, note that 



\S*\ > \S\ - (p + q)Vvn - \T°\ > P\G'\ - A'^n - (/3(1 + a)n - af3 2 n/2) 

> /3n(l + a) - (8 + A')y/rjn - (3(1 + a)n + a/3 2 n/2 > a/3 2 n/3 > 3n/A' > 3|Tj|. 

Note that this calculation is valid for both the bounded degree case and the unbounded degree 
case, with plenty of room to spare in the unbounded case. So by Theorem ll.2l G [»!?*] contains 
a copy of Tj, to which we embed Tj. 

Alternatively, if the current vertex of T con corresponds to some T~, then similarly the 
vertices of T[~ have at most A' neighbours outside T~ which have already been embedded, 
all of which are outneighbours. As before we let v\,. . . ,v r be the vertices of G'[S U S + ] 
to which these vertices have been embedded, and let S* be the set of unoccupied vertices 
of S~ n N~(vi, ... ,v r ). Note that at most |T _ | — \Tf\ vertices of T~ have already been 
embedded. Since some T~ exists we have 

m 

\S*\ > \S~\ -ry/fjn-(\T~\ ~\T~\) > /3~(2 + 2a)n — (8 + A')^/rjn — /3~(1 — a(3)n + \T~\ 
(8) >p-(l + 2a + aP/2)n + \T-\. 

In the final line we used the fact that f3~ > a(3 2 /20 and /3 > 7/3 (so 7], l/A' <C 7, P, (3~). So 
\S*\ > 2(l+a+2/i)|Tr|. Therefore if | Tf \ > P~n/2, then \Tf\ > a/3 2 n/40 > a7 2 n/360 > n' Q , 



For the bounded degree case, |5| > (1 + 7 2 )|T°| by a similar calculation, and so G[S] contains a copy of 
this forest by Lemma [3.11 
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and so we can embed T i in £?[»!?*] by our choice of nf Q . On the other hand, if | < f3 n/2 
then \S*\ > 3\Tr \ by (jSJ, and so we can embed T~ in G[S*} by Theorem rO0 

Finally, if the current vertex of T con corresponds to some T^~, we embed in the un- 
occupied vertices of S + by a similar method to the method used to embed some T~ in the 
unoccupied vertices of GfS 1- ]. We continue in this manner until we have embedded the Tj, 
or TT corresponding to each vertex of T con , at which point we will have obtained an embed- 
ding of T in G, completing the proof. At each stage in this proof we had 'room to spare' 
in our choices, and so the fact that the expressions for |TJ|, \T^~\ and \T~\ and other such 
expressions may not be integers is not a problem. □ 
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